A Connection between GRBF and MLP

Abstract

Both multilayer perceptrons (MLP) and Generalized Radial Basis Functions (GRBF) have good approximation properties, theoretically and experimentally. Are they related? The main point of this paper is to show that for normalized inputs, multilayer perceptron networks are radial function networks (albeit with a non-standard radial function). This provides an interpretation of the weights w as centers t of the radial function network, and therefore as equivalent to templates. This insight may be useful for practical applications, including better initialization procedures for MLP. In the remainder of the paper, we discuss the relation between the radial functions that correspond to the sigmoid for normalized inputs and well-behaved radial basis functions, such as the Gaussian. In particular, we observe that the radial function associated with the sigmoid is an activation function that is good approximation to Gaussian basis functions for a range of values of the bias parameter. The implication is that a MLP network can always simulate a Gaussian GRBF network (with the same number of units but less parameters); the converse is true only for certain values of the bias parameter. Numerical experiments indicate that this constraint is not always satisfied in practice by MLP networks trained with back propagation. Multiscale GRBF networks, on the other hand, can approximate MLP networks with a similar number of parameters.... Radial basis functions, Approximation theory Learning, Multi-layer perceptrons

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 1991
Accession Number
ADA259569

Entities

People

  • Federico Girosi
  • Minoru Maruyama
  • Tomaso Poggio

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • Air Platforms

DTIC Thesaurus Topics

  • Abstracts
  • Artificial Intelligence
  • Coefficients
  • Cognitive Science
  • Department Of Defense
  • Equations
  • Exponential Functions
  • Fourier Series
  • Identities
  • Information Processing
  • Information Systems
  • Learning
  • Military Research
  • Standards
  • Template Patterns
  • Training
  • Two Dimensional

Fields of Study

  • Computer science

Readers

  • Calculus or Mathematical Analysis
  • Neural Network Machine Learning.