Embedded Techniques for High Content Analysis Feature Selection

Abstract

High content analysis (HCA) is a useful technique for extracting unprejudiced explanations for phenotypic responses. However, the massive number of features generated -- often exceeding the number of samples -- necessitates an intermediate feature selection step as part of an overall analytic pipeline. While typical feature selection techniques in the literature focus on more modest feature sizes p < 100, we found that our large feature sets diminished the feasibility of direct wrapper-based approaches, whereas filter-based approaches produced limited feature complementarity.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 23, 2020
Accession Number
AD1123159

Entities

People

  • Daniel Cowan
  • Guanpeng A Xu
  • Heather A. Pangburn
  • Patrick M. Mclendon

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • Human Systems

DTIC Thesaurus Topics

  • Abstracts
  • Accuracy
  • Air Force
  • Air Force Research Laboratories
  • Biotechnology
  • Breast Cancer
  • Clustering
  • Computer Science
  • Contracts
  • Data Analysis
  • Dimensionality Reduction
  • Feature Extraction
  • Feature Selection
  • Government Procurement
  • Governments
  • Information Exchange
  • Machine Learning
  • Motor Skills
  • Pattern Recognition
  • Pipelines
  • Signal Processing
  • Standards
  • Supervised Machine Learning
  • Technical Information Centers
  • United States

Fields of Study

  • Computer science

Readers

  • Operations Research
  • Speech Processing/Speech Recognition.
  • Systems Analysis and Design