OPTIMAL CONTROL OF A DISCRETE TIME STOCHASTIC SYSTEM LINEAR IN THE STATE,

Abstract

Consideration is given to a discrete time finite horizon stochastic control problem whose dynamic equations and loss function are linear in the state vector with random coefficients, but which may vary in a nonlinear, random manner with the control variables. The controls are constrained to lie in a given set. For this system we have the rather surprising result that the optimal control or policy is independent of the value of the state. This fact follows from a simple dynamic programming argument concerning the form of the optimal return function. Under suitable restrictions on the functions, the dynamic programming approach leads to efficient computational methods for obtaining the controls via a sequence of mathematical programming problems in fewer variables than the number of controls in the entire process. The result extends the notion of certainty equivalence for a sequential stochastic decision problem. The expectations of the random functions play the role of certainty equivalents in the sense that the optimal control can be found by solving a deterministic problem in which expectations replace random quantities. (Author)

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 1968
Accession Number
AD0668749

Entities

People

  • Joseph L. Midler

Organizations

  • RAND Corporation

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Coefficients
  • Computational Science
  • Computer Programming
  • Dynamic Programming
  • Equations
  • Mathematical Programming
  • Mathematics
  • Sequences
  • Stochastic Control

Fields of Study

  • Mathematics

Readers

  • Calculus or Mathematical Analysis
  • Operations Research

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Learning Algorithms