AI 27
- PSRO: Policy-Space Response Oracles
- Prompt Optimization
- Scaling Laws & GPT-3
- GPT-1 & GPT-2
- Transformer
- A Quick Guide to LLMs
- TRPO Details
- MetaGrad in LIO
- 多智能体强化学习中的信息设计
- Information Design in Multi-Agent Reinforcement Learning
- Stable Baseline 3
- Overcooked: A MARL Task
- Policy Distillation
- HyperNetworks
- Decision Transformers
- Sequential Social Dilemma
- Fictitious Self-Play and Zero-Shot Coordination
- Policy Gradient Details
- RNNs
- MARL Basics
- Theory of Mind and Markov Models
- MARL Tasks
- RL Toolbox
- MARL Seminar | Simultaneously Learning and Advising in MARL
- MARL Seminar | MADDPG
- MARL Seminar | CommNet
- MARL Seminar | Public Sanctions