Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence

Document Details

Document Type
Pub Defense Publication
Publication Date
Jun 22, 2023
Source ID
10.1137/21m1456789

Entities

People

  • Baihe Huang
  • Jason D. Lee
  • Shicong Cen
  • Wenhao Zhan
  • Yuejie Chi
  • Yuxin Chen

Organizations

  • Air Force Office of Scientific Research
  • Alfred P. Sloan Foundation
  • Army Research Office
  • Carnegie Mellon University
  • Google
  • National Science Foundation
  • Office of Naval Research
  • Princeton University
  • University of Pennsylvania

Tags

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms