Picture for Roman Plaud

Roman Plaud

Tailoring Strictly Proper Scoring Rules for Downstream Tasks: An Application to Causal Inference

Add code
Jun 02, 2026
Viaarxiv icon

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

Add code
Feb 10, 2025
Figure 1 for Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
Figure 2 for Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
Figure 3 for Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
Figure 4 for Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
Viaarxiv icon

Revisiting Hierarchical Text Classification: Inference and Metrics

Add code
Oct 02, 2024
Figure 1 for Revisiting Hierarchical Text Classification: Inference and Metrics
Figure 2 for Revisiting Hierarchical Text Classification: Inference and Metrics
Figure 3 for Revisiting Hierarchical Text Classification: Inference and Metrics
Figure 4 for Revisiting Hierarchical Text Classification: Inference and Metrics
Viaarxiv icon