Picture for Calarina Muslimani

Calarina Muslimani

The Trajectory Alignment Coefficient in Two Acts: From Reward Tuning to Reward Learning

Add code
Jan 23, 2026
Viaarxiv icon

Reward Learning through Ranking Mean Squared Error

Add code
Jan 15, 2026
Viaarxiv icon

Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners

Add code
Mar 08, 2025
Viaarxiv icon

Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity

Add code
Jun 10, 2024
Figure 1 for Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity
Figure 2 for Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity
Figure 3 for Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity
Figure 4 for Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity
Viaarxiv icon

Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning

Add code
Apr 30, 2024
Viaarxiv icon

Reinforcement Teaching

Add code
Apr 25, 2022
Figure 1 for Reinforcement Teaching
Figure 2 for Reinforcement Teaching
Figure 3 for Reinforcement Teaching
Figure 4 for Reinforcement Teaching
Viaarxiv icon