Robust Planning in Domains with Stochastic Outcomes, Adversaries, and Partial Observability

Abstract

Real-world planning problems often feature multiple sources of uncertainty including randomness in outcomes, the presence of adversarial agents and lack of complete knowledge of the world state. This thesis describes algorithms for four related formal models that can address multiple types of uncertainty Markov decision processes, MDPs with adversarial costs, extensive-form games, and a new class of games that includes both extensive-form games and MDPs as special cases. Markov decision processes can represent problems where actions have stochastic outcomes. We describe several new algorithms for MDPs, and then show how MDPs can be generalized to model the presence of an adversary that has some control over costs. Extensive-form games can model games with random events and partial observability. In the zero-sum perfect-recall case a minimax solution can be found in time polynomial in the size of the game tree. However, the game tree must "remember" all past actions and random outcomes, and so the size of the game tree grows exponentially in the length of the game. This thesis introduces a new generalization of extensive-form games that relaxes this need to remember all past actions exactly, producing exponentially smaller representations for interesting problems. Further, this formulation unifies extensive-form games with MDP planning. We present a new class of fast anytime algorithms for the off-line computation of minimax equilibria in both traditional and generalized extensive-form games. Experimental results demonstrate their effectiveness on an adversarial MDP problem and on a large abstracted poker game. We also present a new algorithm for playing repeated extensive-form games that can be used when only the total payoff of the game is observed on each round.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Dec 01, 2006
Accession Number
ADA537262

Entities

People

  • Hugh B. Mcmahan

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Autonomy
  • C4I
  • Energy and Power Technologies
  • Ground and Sea Platforms
  • Human Systems
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence
  • Computational Science
  • Computations
  • Computer Science
  • Convex Programming
  • Distance Learning
  • Game Theory
  • Linear Programming
  • Machine Learning
  • Mathematics
  • Matrix Games
  • Operations Research
  • Probabilistic Models
  • Probability Distributions
  • Random Variables
  • Zero-Sum Games

Fields of Study

  • Computer science

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Game Theory.
  • Mathematical Modeling and Probability Theory.