EVALUATION OF PRINT READER OUTPUT CORRECTION STUDY.

Abstract

This study has tested and evaluated a basic computer software model developed for assisting an optical character recognition unit in deciding character identity by context dependent factors. An operational environment textual data base was used for the study. It had errors introduced as a result of output from an optical character recognition device. Correction techniques used are based, not on full dictionary lookup, but on n-gram occurrence lists, common word dictionaries, environmental dictionaries, and character confusion tables. A conclusion of this evaluation is that the basic programmed model provides a reliable approach for correcting identified errors. It is flexible and accommodates adaptive or learning techniques and is most effective when information about the character in error is supplied by the device. (Author)

Document Details

Document Type
Technical Report
Publication Date
Jul 01, 1969
Accession Number
AD0692521

Entities

People

  • Arthur Mosher
  • Linda Strobel

Organizations

  • International Business Machines Corporation (Armonk, NY)

Tags

DTIC Thesaurus Topics

  • Character Recognition
  • Computer Programs
  • Computers
  • Databases
  • Dictionaries
  • Optical Character Recognition
  • Personality
  • Recognition
  • Test And Evaluation

Readers

  • Computational Linguistics
  • Computational Modeling and Simulation
  • Computer Vision.