Tech 60
- Llama Memo
- PSRO: Policy-Space Response Oracles
- Prompt Optimization
- Scaling Laws & GPT-3
- GPT-1 & GPT-2
- Transformer
- A Quick Guide to LLMs
- Bargaining
- Extensive-Form Games and Subgame Perfect Equilibrium
- Linear Algebra
- Mathematica Memos
- Building My Own PC
- TRPO Details
- MetaGrad in LIO
- Fairness Versus Reason in the Ultimatum Game
- Evolutionary Game Theory
- Code Visualization
- LyPythonToolbox
- Github Memo
- Python Project Template
- 多智能体强化学习中的信息设计
- Information Design in Multi-Agent Reinforcement Learning
- MacOS Workspace
- PyTorch Toolbox
- Python Toolbox
- Stable Baseline 3
- Overcooked: A MARL Task
- Tools of Visual Studio Code
- Policy Distillation
- HyperNetworks
- Decision Transformers
- Set
- Convergence Analysis of Gradient Descent
- Contraction Mapping Theorem
- A Note on Stochastic Processes
- Sequential Social Dilemma
- Zero-Determinant Strategy
- Classic Games
- Information Design in 10 Minutes
- A Memo on Game Theory
- Fictitious Self-Play and Zero-Shot Coordination
- Policy Gradient Details
- RNNs
- MARL Basics
- Computation Graph Visualization
- Dynamic Epistemic Logic
- Theory of Mind and Markov Models
- Theoretical Computer Science (TCS)
- Principal Component Analysis
- Information Design
- MARL Tasks
- RL Toolbox
- Misc Code Toolbox
- Math Toolbox
- Swinging Search and Crawling Control
- RHex-T3: A Mobile Robot, with Hybrid Leg Design
- MARL Seminar | Simultaneously Learning and Advising in MARL
- MARL Seminar | MADDPG
- MARL Seminar | CommNet
- MARL Seminar | Public Sanctions