Divergence Measures Tool:An Introduction with Brief Tutorial

Abstract

This report provides new users of the Divergence Measures Tool (DMTool) with an overview of its core functionality. The DMTool calibrates natural language text corpora for how dissimilar they are from each other, based on the distribution of the relative frequencies of the terms in each corpus. Computation involves a suite of seven information-theoretic divergence measures calculated on given pairs of text files. Users are provided with the resulting scores and list views of the file terms with their frequencies, as used in computing the scores. Use cases and a hands-on tutorial are provided in this report.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 2014
Accession Number
ADA599155

Entities

People

  • Claire E. Jaja
  • Clare R. Voss
  • Douglas M. Briesch
  • Terrence J. Moore

Organizations

  • United States Army Research Laboratory

Tags

Communities of Interest

  • Biomedical
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Application Software
  • Computational Linguistics
  • Computer Science
  • Computers
  • Frequency
  • Information Retrieval
  • Information Science
  • Language
  • Linguistics
  • Machine Translation
  • Mathematical Analysis
  • Military Research
  • Natural Language Processing
  • Natural Languages
  • Spreadsheet Software
  • Word Lists
  • Word Processors

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Database Systems and Applications
  • Regression Analysis.