PLATO: Portable Language-Independent Adaptive Translation from OCR

Abstract

This is the second R&D quarterly progress report (QPR) of the BBN-led team under DARPA's MADCAT program. The goal for the pre-processing and image enhancement task is to eliminate noise artifacts from documents. In this reporting period, we performed preliminary experiments to assess the usefulness of shape-DNA enhancement on machine-print and handwritten images. The shape-DNA approach uses a database of low- and high-resolution shapes and a probabilistic shape-mapping model. The database and mapping are both automatically learned from training data to estimate high-resolution details from low-resolution shapes.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 2008
Accession Number
ADA480938

Entities

People

  • Prem Natarajan

Organizations

  • BBN Technologies

Tags

DTIC Thesaurus Topics

  • Accuracy
  • Computer Vision
  • Databases
  • Department Of Defense
  • Detection
  • Extraction
  • Feature Extraction
  • Governments
  • Graphical User Interface
  • High Resolution
  • Language
  • Low Resolution
  • Machine Learning
  • Recognition
  • Supervised Machine Learning
  • Test Sets
  • Training

Fields of Study

  • Computer science

Readers

  • Clinical Trial Research.
  • Computational Linguistics
  • Computer Vision.