Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence
Document Details
- Document Type
- Pub Defense Publication
- Publication Date
- Jun 22, 2023
- Source ID
- 10.1137/21m1456789
Entities
People
- Baihe Huang
- Jason D. Lee
- Shicong Cen
- Wenhao Zhan
- Yuejie Chi
- Yuxin Chen
Organizations
- Air Force Office of Scientific Research
- Alfred P. Sloan Foundation
- Army Research Office
- Carnegie Mellon University
- National Science Foundation
- Office of Naval Research
- Princeton University
- University of Pennsylvania