Instance-Dependent ℓ∞-Bounds for Policy Evaluation in Tabular Reinforcement Learning
Document Details
- Document Type
- Pub Defense Publication
- Publication Date
- Jan 01, 2021
- Source ID
- 10.1109/tit.2020.3027316
Entities
People
- Ashwin Pananjady
- Martin J. Wainwright
Organizations
- National Science Foundation
- Office of Naval Research