Classifier Systems, Endogenous Fitness, and Delayed Rewards: A Preliminary Investigation

Abstract

Previous work has shown the potential advantages of using endogenous fitness schemes in classifier systems. The basic idea behind endogenous fitness is to reinforce successful system performance with \resources" that rules need in order to reproduce. Instead of storing explicit quantitative estimates of performance, each rule has one or more reservoirs that are used to store resources. When enough resources have been accumulated, a rule utilizes some of its resources to reproduce and the reservoir level is reduced accordingly. This paper extends this concept to accommodate environments having delayed rewards. Reinforcement learning techniques for solving average-reward Markovian decision processes are combined with a simple endogenous fitness scheme in a classifier system. We describe initial tests of this approach on state-space search problems used in previous classifier system studies.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2001
Accession Number
AD1125382

Entities

People

  • Lashon B. Booker

Organizations

  • MITRE Corporation

Tags

Communities of Interest

  • Human Systems

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence
  • Computations
  • Data Science
  • Environment
  • Genetic Algorithms
  • Information Science
  • Learning
  • Machine Learning
  • Maintenance Costs
  • Order Statistics
  • Quality Control
  • Reinforcement Learning
  • Reservoirs
  • Sequences
  • Statistics

Fields of Study

  • Computer science

Readers

  • Mathematical Modeling and Probability Theory.
  • Neural Network Machine Learning.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • Space