Concurrent Learning of Control in Multi-agent Sequential Decision Tasks
Abstract
The overall objective of this project was to develop multi-agent reinforcement learning (MARL) approaches for intelligent agents to autonomously learn distributed control policies in decentralized partially observable Markov decision processes (Dec-POMDPs), without prior knowledge of the model parameters.
Document Details
- Document Type
- Technical Report
- Publication Date
- Apr 17, 2018
- Accession Number
- AD1053581
Entities
People
- Bikramjit Banerjee
Organizations
- University of Southern Mississippi