RL 14
- TRPO Details
- 多智能体强化学习中的信息设计
- Information Design in Multi-Agent Reinforcement Learning
- Stable Baseline 3
- Overcooked: A MARL Task
- Fictitious Self-Play and Zero-Shot Coordination
- Policy Gradient Details
- MARL Basics
- Theory of Mind and Markov Models
- RL Toolbox
- MARL Seminar | Simultaneously Learning and Advising in MARL
- MARL Seminar | MADDPG
- MARL Seminar | CommNet
- MARL Seminar | Public Sanctions