Phrase-based Multimedia Information Extraction

Abstract

StreamSage proposed to develop a prototype software system that would specifically deal with the two primary challenges of speech data on the performance of information extraction: degraded input data and the time-based nature of the content. In order to overcome these two challenges, this effort focused on two general areas: mitigating the degraded quality of speech data and improving entity identification. Technologies developed under this project for audio/video named entity identification and end user access to relevant information could have tremendous value for both military and commercial entities.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 01, 2006
Accession Number
ADA456800

Entities

People

  • Eric Cohen
  • Evelyne Tzoukermann

Tags

Communities of Interest

  • Biomedical

DTIC Thesaurus Topics

  • Air Force Research Laboratories
  • Artificial Intelligence Software
  • Automata Theory
  • Automated Speech Recognition
  • Automated Text Summarization
  • Computational Linguistics
  • Computational Science
  • Computer Languages
  • Identification
  • Language
  • Linguistics
  • Machine Learning
  • Markov Models
  • Models
  • Named Entity Recognition
  • Natural Language Processing
  • Ontologies

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Geospatial Intelligence and Artificial Intelligence Analytics
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - DoD AI Strategy
  • AI & ML - Information Retrieval