Picture for Juan Claude Formanek

Juan Claude Formanek

Self-Supervised On-Policy Reinforcement Learning via Contrastive Proximal Policy Optimisation

Add code
May 13, 2026
Viaarxiv icon

CODA: Coordination via On-Policy Diffusion for Multi-Agent Offline Reinforcement Learning

Add code
Apr 25, 2026
Viaarxiv icon