Picture for Jinglin Yang

Jinglin Yang

P^2O: Joint Policy and Prompt Optimization

Add code
Mar 23, 2026
Viaarxiv icon

Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards

Add code
Mar 10, 2026
Viaarxiv icon

Data Contamination Report from the 2024 CONDA Shared Task

Add code
Jul 31, 2024
Figure 1 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 2 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 3 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 4 for Data Contamination Report from the 2024 CONDA Shared Task
Viaarxiv icon

Noise-Robust De-Duplication at Scale

Add code
Oct 09, 2022
Figure 1 for Noise-Robust De-Duplication at Scale
Figure 2 for Noise-Robust De-Duplication at Scale
Figure 3 for Noise-Robust De-Duplication at Scale
Figure 4 for Noise-Robust De-Duplication at Scale
Viaarxiv icon