Learning Algorithms for the Multilayer Perceptron

Abstract

Central to the development of adaptive pattern processing algorithms (adaptive filters) for random problems - problems where statistics are unknown a priori and/or explicit rules governing behavior cannot be extracted in a reductionist manner - is the pursuit of adaptive architectures for associating arbitrary inputs to outputs. Such "associative memories" are important for providing the mathematical mapping (transfer function) relating inputs to outputs arising from implicit relationships found in a given training ensemble. The adoption of these filters or architectures during training is guided by a learning algorithm, mathematically derived from an objective function to ensure good association properties. The subject of this paper is an investigation of a class of learning algorithms for the highly parallel multilayer perception architecture used in an associative memory context. By controlling the scheduling of patterns presented during training, a generalized class of learning algorithms are shown to result. Specific realizations of the generalized algorithm include steepest descent (parameters adapted following presentation of all training patterns), Rumelhart back propagation (parameters adapted following presentation of each pattern), and a new algorithm which captures in part the benefits of both, less parameter adaption and faster convergence, by gradually varying the number of patterns presented per parameter adaption.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 28, 1988
Accession Number
ADA202682

Entities

People

  • M. D. Eggers
  • T. S. Khuon

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Adaptive Filters
  • Adaptive Systems
  • Artificial Intelligence
  • Computational Science
  • Computations
  • Computing System Architectures
  • Content Addressable Memory
  • Convergence
  • Filters
  • Information Science
  • Machine Learning
  • Neural Networks
  • Scheduling (Production)
  • Signal Processing
  • Statistics
  • Training

Readers

  • Calculus or Mathematical Analysis
  • Neural Network Machine Learning.
  • Systems Analysis and Design