Stochastic Approximations for Finite-State Markov Chains

Abstract

In constrained Markov decision problems, optimal policies are often found to depend on quantities that are not readily available due to either insufficient knowledge of the model parameters or computational difficulties. This motivates the on-line estimation (or computation) problem investigated in this paper in the context of a single parameter family of finite-state Markov chains. The computation is implemented through an algorithm of the Stochastic Approximations type, which recursively generates on-line estimates for the unknown value. A useful methodology is outlined for investigating the strong consistency of the algorithm, and the proof is carried out under a set of simplifying assumptions to illustrate the key ideas unencumbered with technical details. An application to constrained Markov decision processes is briefly discussed.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1987
Accession Number
ADA452264

Entities

People

  • A. M. Makowski
  • A. Shwartz
  • D-j. Ma

Organizations

  • University of Maryland

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Computations
  • Consistency
  • Electrical Engineering
  • Engineering
  • Information Operations
  • Markov Chains
  • Mathematics
  • Standards
  • Universities

Fields of Study

  • Mathematics

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Mathematical Modeling and Probability Theory.
  • Theoretical Analysis.