Computing Approximate Solutions to Markov Renewal Programs with Continuous State Spaces

Abstract

Value iteration and policy iteration are two well known computational methods for solving Markov renewal decision processes. Value iteration converges linearly, while policy iteration (typically) converges quadratically and is therefore more attractive in principle. However, when the state space is very large (or continuous), the latter asks for solving at each iteration a large linear system (or integral equation) and becomes unpractical. We propose an approximate policy iteration method, targeted especially to systems with continuous or large state spaces, for which the Bellman (expected cost-to-go) function is relatively smooth (or piecewise smooth). These systems occur quite frequently in practice. The method is based on an approximation of the Bellman function by a linear combination of an a priori fixed set of base functions. At each policy iteration, we build a linear system in terms of the coefficients of these base functions, and solve this system approximately. We give special attention to a particular case of finite element approximation where the Bellman function is expressed directly as a convex combination of its values at a finite set of grid points.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 1989
Accession Number
ADA243074

Entities

People

  • Pierre L'ecuyer

Organizations

  • Laval University

Tags

Communities of Interest

  • C4I
  • Counter WMD
  • Human Systems
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Analytic Functions
  • Coefficients
  • Computer Programming
  • Convergence
  • Differential Equations
  • Dynamic Programming
  • Eigenvalues
  • Equations
  • Finite Element Analysis
  • Integrals
  • Linear Programming
  • Linear Systems
  • Markov Chains
  • Partial Differential Equations
  • Probability
  • Stochastic Processes

Fields of Study

  • Mathematics

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Approximation Theory.
  • Mathematical Modeling and Probability Theory.

Technology Areas

  • Space