Picture for Pablo Samuel Castro

Pablo Samuel Castro

Stable Deep Reinforcement Learning via Isotropic Gaussian Representations

Add code
Feb 22, 2026
Viaarxiv icon

Discovering Differences in Strategic Behavior Between Humans and LLMs

Add code
Feb 10, 2026
Viaarxiv icon

A Comedy of Estimators: On KL Regularization in RL Training of LLMs

Add code
Dec 26, 2025
Viaarxiv icon

ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning

Add code
Oct 16, 2025
Viaarxiv icon

Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning

Add code
Oct 02, 2025
Viaarxiv icon

Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning

Add code
Jun 18, 2025
Figure 1 for Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Figure 2 for Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Figure 3 for Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Figure 4 for Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Viaarxiv icon

Adaptive Accompaniment with ReaLchords

Add code
Jun 17, 2025
Viaarxiv icon

The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning

Add code
Jun 16, 2025
Figure 1 for The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Figure 2 for The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Figure 3 for The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Figure 4 for The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Viaarxiv icon

Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning

Add code
May 29, 2025
Viaarxiv icon

Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning

Add code
May 23, 2025
Viaarxiv icon