Numerical Estimation of Information Theoretic Measures for Large Data Sets

Abstract

A problem that has plagued the tracking community for decades has been the lack of a single metric to assess the overall performance of tracking systems. The authors' prior research has identified total conditional entropy as a useful measure to assess the overall performance of multi-target trackers and classifiers. The measure can be used to evaluate any system that can be formulated as an assignment algorithm that maps N classes of objects to M labels. The assignments from the decision system are compared to a truth data set to generate the total conditional entropy. This report focuses on work to generate error estimates so that the statistical significance of test results can be determined. Derivations of Wolpert and Wolf provide exact equations for calculating the first- and second-order statistics for the three fundamental entropy measures from sample data. The authors have restructured Wolpert and Wolf's equations into computer code that produces stable numerical results up to sample sizes in the billions. This code is provided as a MATLAB software package available from MIT Lincoln Laboratory.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 30, 2013
Accession Number
ADA580524

Entities

People

  • Edward K Kao
  • Michael B. Hurley

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • C4I
  • Energy and Power Technologies
  • Human Systems
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Accuracy
  • Air Force
  • Algorithms
  • Communities
  • Computations
  • Computers
  • Data Science
  • Data Sets
  • Detectors
  • Equations
  • Information Science
  • Machine Learning
  • Order Statistics
  • Probability
  • Statistics
  • Two Dimensional

Readers

  • Finite Element Method (FEM) for solving Partial Differential Equations (PDEs)
  • Sensor Fusion and Tracking Systems.
  • Statistical inference.