Value-Function Approximations for Partially Observable Markov Decision Processes

Abstract

Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in which states of the system are observable only indirectly, via a set of imperfect or noisy observations. The modeling advantage of POMDPs, however, comes at a price-- exact methods for solving them are computationally very expensive and thus applicable in practice only to very simple problems. We focus on efficient approximation (heuristic) methods that attempt to alleviate the computational problem and trade off accuracy for speed.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 01, 2000
Accession Number
ADA635868

Entities

People

  • Milos Hauskrecht

Organizations

  • Brown University

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence
  • Computational Science
  • Computer Programming
  • Computer Science
  • Extrapolation
  • Interpolation
  • Linear Programming
  • Lisp Programming Language
  • Motion Planning
  • Navigation
  • Optimization
  • Probability
  • Probability Distributions
  • Random Variables
  • Simulations

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.