Document Image Parsing and Understanding using Neuromorphic Architecture

Abstract

In this project, we investigate brain inspired information processing for text image recognition and its performance/accuracy optimization. The intelligent text recognition system (ITRS) works robustly on images with low quality by using a combination of input image data preparation, pattern matching and statistical inference. Our experimental results show that, compared to Tesseract, the ITRS achieves comparable accuracy for clean input images and higher accuracy for camera images with occlusions. Performance enhancement techniques are developed to reduce the processing speed at different layers. In the pattern matching layer, the computing power of multicore processors is explored to reduce the processing time. In the word confabulation layer, new data structures are adopted for the storage of a dictionary, which increases memory locality and reduces search complexity. In the sentence confabulation layer, different sentence models are compared and the best model with the highest accuracy is identified. Finally, the overall ITRS is implemented on a heterogeneous high performance computing cluster. It provides users with the flexibility of computing resource management through a configuration file.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 2015
Accession Number
ADA620044

Entities

People

  • Qinru Qiu

Organizations

  • Syracuse University

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Accuracy
  • Air Force
  • Air Force Research Laboratories
  • Artificial Intelligence Software
  • Bayesian Networks
  • Brain
  • Character Recognition
  • High Performance Computing
  • Image Processing
  • Image Recognition
  • Information Processing
  • Information Science
  • Neural Networks
  • Parallel Processing
  • Recognition
  • Statistical Inference
  • Supervised Machine Learning

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Neural Network Machine Learning.
  • Parallel and Distributed Computing.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation