Value-Function Approximations for Partially Observable Markov Decision Processes

Abstract

Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in which states of the system are observable only indirectly, via a set of imperfect or noisy observations. The modeling advantage of POMDPs, however, comes at a price-- exact methods for solving them are computationally very expensive and thus applicable in practice only to very simple problems. We focus on efficient approximation (heuristic) methods that attempt to alleviate the computational problem and trade off accuracy for speed.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Aug 01, 2000
Accession Number: ADA635868

Entities

People

Milos Hauskrecht

Organizations

Brown University

Value-Function Approximations for Partially Observable Markov Decision Processes

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Readers