Picture for Xi Chen

Xi Chen

M-PSI

On the Limitations of Rank-One Model Editing in Answering Multi-hop Questions

Add code
Jan 08, 2026
Viaarxiv icon

All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection

Add code
Jan 08, 2026
Viaarxiv icon

Learning to Diagnose and Correct Moral Errors: Towards Enhancing Moral Sensitivity in Large Language Models

Add code
Jan 06, 2026
Viaarxiv icon

GDRO: Group-level Reward Post-training Suitable for Diffusion Models

Add code
Jan 05, 2026
Viaarxiv icon

Evaluating the Diagnostic Classification Ability of Multimodal Large Language Models: Insights from the Osteoarthritis Initiative

Add code
Jan 05, 2026
Viaarxiv icon

Step-DeepResearch Technical Report

Add code
Dec 24, 2025
Viaarxiv icon

Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

Add code
Dec 21, 2025
Figure 1 for Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
Figure 2 for Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
Figure 3 for Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
Figure 4 for Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
Viaarxiv icon

Video Detective: Seek Critical Clues Recurrently to Answer Question from Long Videos

Add code
Dec 19, 2025
Viaarxiv icon

Spectro-temporal unitary transformations for coherent modulation: design trade-offs and practical considerations

Add code
Dec 19, 2025
Viaarxiv icon

Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection

Add code
Dec 18, 2025
Viaarxiv icon