Picture for Andrei Lupu

Andrei Lupu

Jack

Ad-Hoc Human-AI Coordination Challenge

Add code
Jun 26, 2025
Viaarxiv icon

The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind

Add code
Jun 25, 2025
Viaarxiv icon

Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps

Add code
Dec 22, 2024
Viaarxiv icon

CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants

Add code
Oct 28, 2024
Figure 1 for CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants
Figure 2 for CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants
Figure 3 for CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants
Figure 4 for CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Behaviour Distillation

Add code
Jun 21, 2024
Figure 1 for Behaviour Distillation
Figure 2 for Behaviour Distillation
Figure 3 for Behaviour Distillation
Figure 4 for Behaviour Distillation
Viaarxiv icon

Discovering Minimal Reinforcement Learning Environments

Add code
Jun 18, 2024
Viaarxiv icon

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Add code
Feb 26, 2024
Figure 1 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 2 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 3 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 4 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Viaarxiv icon

JaxMARL: Multi-Agent RL Environments in JAX

Add code
Nov 20, 2023
Viaarxiv icon

Grounding Aleatoric Uncertainty in Unsupervised Environment Design

Add code
Jul 11, 2022
Figure 1 for Grounding Aleatoric Uncertainty in Unsupervised Environment Design
Figure 2 for Grounding Aleatoric Uncertainty in Unsupervised Environment Design
Figure 3 for Grounding Aleatoric Uncertainty in Unsupervised Environment Design
Figure 4 for Grounding Aleatoric Uncertainty in Unsupervised Environment Design
Viaarxiv icon