Estimating from Misclassified Data

Abstract

A statistical method for estimating the proportions of items in each of several categories, based on an item-by-item classification in which many items may be misclassified. A specific case of interest is that in which the items are subjects being interviewed and the subjects may be hostile. Maximum likelihood estimators are developed for both the two-category and the multicategory response cases with respect to a group of noncooperative interviewees. An assessment is made, for each subject, of the probability that he is hostile. These probabilities are then combined with the actual responses to obtain the maximum likelihood estimators. Explicit evaluation of the estimators for a sample of n subjects and r categories requires solution of a simple concave programming problem involving a logarithmic objective function in variables confined to the unit interval. A Bayesian approach is used to evaluate the misclassification (or hostility) probabilities. It is assumed that the analysis applies to a single question only. For a survey containing many questions the estimators would be evaluated separately for each question. The analysis can also be generalized to consider many questions simultaneously.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 01, 1967
Accession Number
AD0655845

Entities

People

  • S. J. Press

Organizations

  • RAND Corporation

Tags

DTIC Thesaurus Topics

  • Applied Mathematics
  • Bayesian Networks
  • Computer Programming
  • Equations
  • Estimators
  • Human Population
  • Information Science
  • International Security
  • Maximum Likelihood Estimation
  • Probability
  • Probability Distributions
  • Random Variables
  • Statistical Algorithms
  • Statistical Inference
  • Statistics
  • Surveys
  • Test And Evaluation

Fields of Study

  • Mathematics

Readers

  • Gender and Food Studies
  • Life Cycle Cost Analysis
  • Neural Network Machine Learning.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference