Adaptive Policies for Markov Renewal Programs
Abstract
The paper recasts a class of infinite-state, infinite-action Markov renewal programs with unknown parameters as one-state programs with actions corresponding to stationary policies in the original program. Under suitable conditions, an adaptive (nonstationary) optimal policy is found in the sense of maximizing long-run expected reward per unit time.
Document Details
- Document Type
- Technical Report
- Publication Date
- Nov 01, 1971
- Accession Number
- AD0737320
Entities
People
- Bennett L. Fox
- John E. Rolph
Organizations
- RAND Corporation