Picture for Carlo Romeo

Carlo Romeo

SOPE: Stabilizing Off-Policy Evaluation for Online RL with Prior Data

Add code
May 07, 2026
Viaarxiv icon

NTRL: Encounter Generation via Reinforcement Learning for Dynamic Difficulty Adjustment in Dungeons and Dragons

Add code
Jun 24, 2025
Viaarxiv icon

SPEQ: Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning

Add code
Jan 15, 2025
Viaarxiv icon

Offline Reinforcement Learning with Imputed Rewards

Add code
Jul 15, 2024
Figure 1 for Offline Reinforcement Learning with Imputed Rewards
Figure 2 for Offline Reinforcement Learning with Imputed Rewards
Figure 3 for Offline Reinforcement Learning with Imputed Rewards
Viaarxiv icon