COMPUTATIONAL EFFICIENCY IN THE SELECTION OF REGRESSION VARIABLES.

Abstract

A number of criteria have been proposed for selecting the best subset or subsets of independent variables in linear regression analysis. Applying these criteria to all possible subsets is, in general, infeasible if the number of candidate variables in large. Since most criteria are monotone functions of the residual sum of squares, the problem is reduced to identifying subsets for which this quantity is small. In the report a method is described which will identify best subsets while considering only a small fraction of the possible subsets. The method is based on a branch and bound technique and will identify the best subset of each size and has the added feature that a number of nearly best subsets are also revealed.

Document Details

Document Type
Technical Report
Publication Date
Jul 01, 1969
Accession Number
AD0691216

Entities

People

  • L. R. Lamotte
  • R. R. Hocking

Organizations

  • Texas A&M University

Tags

DTIC Thesaurus Topics

  • Computing-Related Activities
  • Data Science
  • Efficiency
  • Information Science
  • Interdisciplinary Science
  • Linear Regression Analysis
  • Mathematical Analysis
  • Mathematics
  • Monotone Functions
  • Regression Analysis
  • Residuals
  • Statistical Analysis

Fields of Study

  • Mathematics

Readers

  • Statistical inference.
  • Systems Analysis and Design