Weak Human Preference Supervision for Deep Reinforcement Learning
Document Details
- Document Type
- Pub Defense Publication
- Publication Date
- Dec 01, 2021
- Source ID
- 10.1109/tnnls.2021.3084198
Entities
People
- Chin-Teng Lin
- Kaichiu Wong
- Zehong Cao
Organizations
- Australian Research Council
- Office of Naval Research Global
- University of Tasmania