A Whole Word and Number Reading Machine Based on Two Dimensional Low Frequency Fourier Transforms

Abstract

The Fourier transform is investigated as a means for developing an optical reader capable of reading a large vocabulary without segmenting the image of a word into individual characters. The reader is capable of reading printed and cursive font styles, is scale invariant, and is substantially insensitive to noise. The image of a particular word is treated as a single symbol; the two dimensional low frequency Fourier coefficients (assuming n coefficients are calculated) define the word's location on an n dimensional hypersphere of unit radius. The distance between individual locations (words) categorizes similar and dissimilar words. The smaller the distance, the more similar images are. Multiple images of a word using various font styles form a unique cluster on the surface of the hypersphere. The distance between clusters (different words) is greater than the distance across a cluster (same word in different font styles). Therefore, by using the centroid of these clusters to build a library of words, input or test words match to the nearest cluster centroid using a minimum distance calculation. This algorithm is capable of correctly recognizing at least 5000 words using 24 various font styles (120,000 individual images).

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Dec 01, 1990
Accession Number
ADA229036

Entities

People

  • Mark A. O'hair

Organizations

  • Air Force Institute of Technology

Tags

Communities of Interest

  • Biomedical
  • Energy and Power Technologies
  • Ground and Sea Platforms
  • Human Systems

DTIC Thesaurus Topics

  • Abstracts
  • Agreements
  • Algorithms
  • Character Recognition
  • Cognition
  • Commerce
  • Computer Programs
  • Computers
  • Employment
  • Health Services
  • Identification
  • Law
  • Personnel Management
  • Psychology
  • Reading Machines
  • Recognition
  • Two Dimensional

Readers

  • Computer Programming and Software Development.
  • Computer Vision.
  • Educational Psychology