Evidence for Increased Discriminability in Judging the Acceptability of Machine Translations: The Case for Magnitude Estimation

Abstract

An earlier experiment, in which magnitude estimation (ME) was used as the method for judging the acceptability of machine translations, was replicated with one exception. The current study used a four-point Likert scale as the measurement methodology. In the earlier work, using ME, judges easily discriminated between two sets of machine translations, one containing 25% more correctly translated names than the other. In this study, using a Likert scale, judges were not able to make the same discrimination. These results support the theory that ME may be a superior measurement methodology for assessing the acceptability of machine translations.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 01, 2009
Accession Number
ADA499858

Entities

People

  • James D. Walrath

Organizations

  • United States Army Research Laboratory

Tags

Communities of Interest

  • Weapons Technologies

DTIC Thesaurus Topics

  • Acceptability
  • Chi Square Test
  • Computational Science
  • Department Of Defense
  • Information Operations
  • Information Science
  • Instructions
  • Judgment
  • Language
  • Machine Translation
  • Measurement
  • Military Research
  • Pilot Studies
  • Psychology
  • Standards
  • Test Methods
  • Translations

Fields of Study

  • Psychology

Readers

  • Instructional Design and Training Evaluation.
  • Political Science/ International Relations/ European Studies
  • Psychometric Testing or Psychological Assessment.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Translation