Picture for Chi Jin

Chi Jin

Towards Principled Superhuman AI for Multiplayer Symmetric Games

Add code
Jun 06, 2024
Figure 1 for Towards Principled Superhuman AI for Multiplayer Symmetric Games
Figure 2 for Towards Principled Superhuman AI for Multiplayer Symmetric Games
Viaarxiv icon

On Limitation of Transformer for Learning HMMs

Add code
Jun 06, 2024
Viaarxiv icon

FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning

Add code
Jun 04, 2024
Figure 1 for FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Figure 2 for FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Figure 3 for FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Figure 4 for FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Viaarxiv icon

Tuning-Free Stochastic Optimization

Add code
Feb 12, 2024
Viaarxiv icon

Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

Add code
Nov 27, 2023
Viaarxiv icon

ZeroSwap: Data-driven Optimal Market Making in DeFi

Add code
Oct 13, 2023
Figure 1 for ZeroSwap: Data-driven Optimal Market Making in DeFi
Figure 2 for ZeroSwap: Data-driven Optimal Market Making in DeFi
Figure 3 for ZeroSwap: Data-driven Optimal Market Making in DeFi
Figure 4 for ZeroSwap: Data-driven Optimal Market Making in DeFi
Viaarxiv icon

Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning

Add code
Sep 29, 2023
Figure 1 for Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Figure 2 for Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Figure 3 for Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Figure 4 for Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Viaarxiv icon

Is RLHF More Difficult than Standard RL?

Add code
Jun 25, 2023
Figure 1 for Is RLHF More Difficult than Standard RL?
Viaarxiv icon

Context-lumpable stochastic bandits

Add code
Jun 22, 2023
Viaarxiv icon

DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method

Add code
May 25, 2023
Figure 1 for DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method
Figure 2 for DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method
Figure 3 for DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method
Figure 4 for DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method
Viaarxiv icon