Instance-Dependent ℓ∞-Bounds for Policy Evaluation in Tabular Reinforcement Learning

Document Details

Document Type
Pub Defense Publication
Publication Date
Jan 01, 2021
Source ID
10.1109/tit.2020.3027316

Entities

People

  • Ashwin Pananjady
  • Martin J. Wainwright

Organizations

  • National Science Foundation
  • Office of Naval Research

Tags

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms