Automatic Typeset Input Techniques Evaluation.

Abstract

VITIES OF THE Air Force Foreign Technology Division Machine Translation Facilities. This effort was to evaluate existing optical character recognition capabilities toward the total requirement of a Russian Typeset Print Reader. In this research, four pages (4) of original scientific Russian text were used as the data base. The contractor demonstrated that the scanning and conversion of Russian text by OCR is feasible and potentially economical. In summary the results of the analysis of scanning the four samples indicates that specializing the software towards this particular application and providing methods to enable automatic format and font recognition will be most effective in reducing the total error rate. An ultimate total error rate of less than .5% appears achievable.

Document Details

Document Type
Technical Report
Publication Date
Feb 01, 1972
Accession Number
AD0755949

Entities

People

  • Daniel M. Forsyth

Tags

DTIC Thesaurus Topics

  • Air Force
  • Automatic
  • Buildings And Structures
  • Character Recognition
  • Contractors
  • Conversion
  • Databases
  • Foreign Technology
  • Identification
  • Machine Translation
  • Optical Character Recognition
  • Pattern Recognition
  • Personality
  • Recognition
  • Scanning
  • Test And Evaluation

Readers

  • Computational Linguistics
  • Speech Processing/Speech Recognition.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation