Picture for Eugene Tarassov

Eugene Tarassov

Offline Regularised Reinforcement Learning for Large Language Models Alignment

Add code
May 29, 2024
Viaarxiv icon

Understanding the performance gap between online and offline alignment algorithms

May 14, 2024
Viaarxiv icon

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

Sep 22, 2022
Viaarxiv icon

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Jun 30, 2022
Figure 1 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 2 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 3 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 4 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Viaarxiv icon

Time-series Imputation of Temporally-occluded Multiagent Trajectories

Add code
Jun 08, 2021
Figure 1 for Time-series Imputation of Temporally-occluded Multiagent Trajectories
Figure 2 for Time-series Imputation of Temporally-occluded Multiagent Trajectories
Figure 3 for Time-series Imputation of Temporally-occluded Multiagent Trajectories
Figure 4 for Time-series Imputation of Temporally-occluded Multiagent Trajectories
Viaarxiv icon