Learning Algorithms for the Multilayer Perceptron

Abstract

Central to the development of adaptive pattern processing algorithms (adaptive filters) for random problems - problems where statistics are unknown a priori and/or explicit rules governing behavior cannot be extracted in a reductionist manner - is the pursuit of adaptive architectures for associating arbitrary inputs to outputs. Such "associative memories" are important for providing the mathematical mapping (transfer function) relating inputs to outputs arising from implicit relationships found in a given training ensemble. The adoption of these filters or architectures during training is guided by a learning algorithm, mathematically derived from an objective function to ensure good association properties. The subject of this paper is an investigation of a class of learning algorithms for the highly parallel multilayer perception architecture used in an associative memory context. By controlling the scheduling of patterns presented during training, a generalized class of learning algorithms are shown to result. Specific realizations of the generalized algorithm include steepest descent (parameters adapted following presentation of all training patterns), Rumelhart back propagation (parameters adapted following presentation of each pattern), and a new algorithm which captures in part the benefits of both, less parameter adaption and faster convergence, by gradually varying the number of patterns presented per parameter adaption.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Oct 28, 1988
Accession Number: ADA202682

Entities

People

M. D. Eggers
T. S. Khuon

Organizations

Massachusetts Institute of Technology

Learning Algorithms for the Multilayer Perceptron

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Readers