Text Retrieval via Semantic Forests

Abstract

We approached our first participation in TREC with an interest in performing retrieval on the output of automatic speech-to-text (speech recognition) systems and a background in performing topic-labeling on such output. Our primary thrust, therefore, was to participate in the SDR track. In conformance with the rules, we also participated in the Ad Hoc text-retrieval task, to create a baseline for comparing our converted topic-labeling system with other approaches to IR and to assess the effect of speech-transcription errors. A second thrust was to explore rapid prototyping of an IR system, given the existing topic-labeling software. Our IR system makes use of software called Semantic Forests which is based on an algorithm originally developed for labeling topics in text and transcribed speech (Schone & Nelson, ICASSP 96). Topic-labelling is not an IR task, so Semantic Forests was adapted for use in TREC over an eight-week period for the Ad Hoc task, with an additional two weeks for SDR. In what follows, we describe our system as well as experiments, timings, results, and future directions with these techniques.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 1997
Accession Number
ADA470518

Entities

People

  • Calvin Olano
  • Jeffrey L. Townsend
  • Patrick Schone
  • Thomas H. Crystal

Organizations

  • United States Department of Defense

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Agreements
  • Algorithms
  • Automated Speech Recognition
  • Automatic
  • Computer Science
  • Data Sets
  • Databases
  • Department Of Defense
  • Dictionaries
  • Frequency
  • Information Operations
  • Lists (Data Structures)
  • Numbers
  • Software Prototyping
  • Test And Evaluation
  • Training

Fields of Study

  • Computer science

Readers

  • Forest Ecology
  • Information Retrieval
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Machine Translation
  • AI & ML - Neural Networks