Generalization through Minimal Networks with Application to Forecasting,

Abstract

Inspired by the information theoretic idea of minimum description length, we add a term to the usual back-propagation cost function that penalizes network complexity. From a Bayesian perspective, the complexity term can be usefully interpreted as an assumption about prior distribution of the weights. This method, called weight-elimination, is contrasted to ridge regression and to cross-validation. We apply weight-elimination to time series prediction. On the sunspot series, the network outperforms traditional statistical approaches and shows the same predictive power as multivariate adaptive regression splines.

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1992
Accession Number
ADP007170

Entities

People

  • Andreas S. Weigend
  • David E. Rumelhart

Organizations

  • Stanford University

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Computer Science
  • Computing-Related Activities
  • Data Science
  • Delphi Method
  • Elimination
  • Engineering
  • Information Science
  • Interdisciplinary Science
  • Mathematics
  • Statistical Analysis
  • Statistics
  • Theoretical Computer Science
  • Validation

Fields of Study

  • Computer science

Readers

  • Neural Network Machine Learning.
  • Regression Analysis.
  • Statistical inference.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Neural Networks