Stochastic Approximations for Finite-State Markov Chains

Abstract

In constrained Markov decision problems, optimal policies are often found to depend on quantities that are not readily available due to either insufficient knowledge of the model parameters or computational difficulties. This motivates the on-line estimation (or computation) problem investigated in this paper in the context of a single parameter family of finite-state Markov chains. The computation is implemented through an algorithm of the Stochastic Approximations type, which recursively generates on-line estimates for the unknown value. A useful methodology is outlined for investigating the strong consistency of the algorithm, and the proof is carried out under a set of simplifying assumptions to illustrate the key ideas unencumbered with technical details. An application to constrained Markov decision processes is briefly discussed.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Jan 01, 1987
Accession Number: ADA452264

Entities

People

A. M. Makowski
A. Shwartz
D-j. Ma

Organizations

University of Maryland

Stochastic Approximations for Finite-State Markov Chains

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Fields of Study

Readers