Boosting Information Fusion

Abstract

Ensemble methods provide a principled framework for building high performance classifiers and representing many types of data. As a result, these methods can be useful for making inferences in many domains such as classification and multi-modal biometrics. We introduce a novel ensemble method for combining multiple representations (or views). The method is a multiple view generalization of AdaBoost. Similar to AdaBoost, base classifiers are independently built from each representation. Unlike AdaBoost, however, all data types share the same sampling distribution as the view whose weighted training error is the smallest among all the views. As a result, the most consistent data type dominates over time, thereby significantly reducing sensitivity to noise. In addition, our proposal is provably better than AdaBoost trained on any single type of data. The proposed method is applied to the problems of facial and gender prediction based on biometric traits as well as of protein classification. Experimental results show that our method outperforms several competing techniques including kernel-based data fusion.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 01, 2010
Accession Number
ADA564315

Entities

People

  • Costin Barbu
  • Guna Seetharaman
  • Jing Peng

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • Materials and Manufacturing Processes
  • Sensors

DTIC Thesaurus Topics

  • Accuracy
  • Air Force Research Laboratories
  • Algorithms
  • Classification
  • Computer Science
  • Data Fusion
  • Data Sets
  • Detection
  • Factor Analysis
  • Information Science
  • Kernel Functions
  • Machine Learning
  • Military Research
  • Pattern Recognition
  • Probability
  • Sampling
  • Training

Fields of Study

  • Computer science

Readers

  • Computer Vision.
  • Distributed Systems and Data Platform Development

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks