Picture for Eugene Tarassov

Eugene Tarassov

Offline Regularised Reinforcement Learning for Large Language Models Alignment

Add code
May 29, 2024
Figure 1 for Offline Regularised Reinforcement Learning for Large Language Models Alignment
Figure 2 for Offline Regularised Reinforcement Learning for Large Language Models Alignment
Figure 3 for Offline Regularised Reinforcement Learning for Large Language Models Alignment
Figure 4 for Offline Regularised Reinforcement Learning for Large Language Models Alignment
Viaarxiv icon

Understanding the performance gap between online and offline alignment algorithms

Add code
May 14, 2024
Viaarxiv icon

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

Add code
Sep 22, 2022
Viaarxiv icon

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Add code
Jun 30, 2022
Figure 1 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 2 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 3 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 4 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Viaarxiv icon

Time-series Imputation of Temporally-occluded Multiagent Trajectories

Add code
Jun 08, 2021
Figure 1 for Time-series Imputation of Temporally-occluded Multiagent Trajectories
Figure 2 for Time-series Imputation of Temporally-occluded Multiagent Trajectories
Figure 3 for Time-series Imputation of Temporally-occluded Multiagent Trajectories
Figure 4 for Time-series Imputation of Temporally-occluded Multiagent Trajectories
Viaarxiv icon