On Combining Language Models: Oracle Approach

Abstract

In this paper, we address the of combining several language models (LMs). We find that simple interpolation methods, like log-linear and linear interpolation, improve the performance but fall short of the performance of an oracle. The oracle knows the reference word string and selects the word string with the best performance (typically, word or semantic error rate) from a list of word strings, where each word string has been obtained by using a different LM. Actually, the oracle acts like a dynamic combiner with hard decisions using the reference. We provide experimental results that clearly show the need for a dynamic language model combination to improve the performance further. We suggest a method that mimics the behavior of the oracle using a neural network or a decision tree. The method amounts to tagging LMs with confidence measures and picking the best hypothesis corresponding the LM with the best confidence.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Jan 01, 2001
Accession Number: ADA460991

Entities

People

Kadri Hacioglu
Wayne Ward

Organizations

University of Colorado Boulder

On Combining Language Models: Oracle Approach

Abstract

Document Details

Entities

People

Organizations

Tags

DTIC Thesaurus Topics

Fields of Study

Readers

Technology Areas