Picture for Chris Lu

Chris Lu

Discovering Preference Optimization Algorithms with and for Large Language Models

Add code
Jun 12, 2024
Figure 1 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 2 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 3 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 4 for Discovering Preference Optimization Algorithms with and for Large Language Models
Viaarxiv icon

Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning

Add code
Jun 01, 2024
Figure 1 for Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
Figure 2 for Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
Figure 3 for Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
Figure 4 for Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
Viaarxiv icon

Revisiting Recurrent Reinforcement Learning with Memory Monoids

Add code
Feb 15, 2024
Figure 1 for Revisiting Recurrent Reinforcement Learning with Memory Monoids
Figure 2 for Revisiting Recurrent Reinforcement Learning with Memory Monoids
Figure 3 for Revisiting Recurrent Reinforcement Learning with Memory Monoids
Figure 4 for Revisiting Recurrent Reinforcement Learning with Memory Monoids
Viaarxiv icon

Analysing the Sample Complexity of Opponent Shaping

Add code
Feb 08, 2024
Viaarxiv icon

Discovering Temporally-Aware Reinforcement Learning Algorithms

Add code
Feb 08, 2024
Viaarxiv icon

Meta-learning the mirror map in policy mirror descent

Add code
Feb 07, 2024
Figure 1 for Meta-learning the mirror map in policy mirror descent
Figure 2 for Meta-learning the mirror map in policy mirror descent
Figure 3 for Meta-learning the mirror map in policy mirror descent
Figure 4 for Meta-learning the mirror map in policy mirror descent
Viaarxiv icon

Leading the Pack: N-player Opponent Shaping

Add code
Dec 26, 2023
Figure 1 for Leading the Pack: N-player Opponent Shaping
Figure 2 for Leading the Pack: N-player Opponent Shaping
Figure 3 for Leading the Pack: N-player Opponent Shaping
Figure 4 for Leading the Pack: N-player Opponent Shaping
Viaarxiv icon

Scaling Opponent Shaping to High Dimensional Games

Add code
Dec 19, 2023
Figure 1 for Scaling Opponent Shaping to High Dimensional Games
Figure 2 for Scaling Opponent Shaping to High Dimensional Games
Figure 3 for Scaling Opponent Shaping to High Dimensional Games
Figure 4 for Scaling Opponent Shaping to High Dimensional Games
Viaarxiv icon

JaxMARL: Multi-Agent RL Environments in JAX

Add code
Nov 20, 2023
Viaarxiv icon

Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design

Add code
Oct 04, 2023
Viaarxiv icon