Picture for Xiusi Chen

Xiusi Chen

May

Decoding the Critique Mechanism in Large Reasoning Models

Add code
Mar 17, 2026
Viaarxiv icon

How Far Can Unsupervised RLVR Scale LLM Training?

Add code
Mar 09, 2026
Viaarxiv icon

Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents

Add code
Jan 29, 2026
Viaarxiv icon

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning

Add code
Jan 17, 2026
Viaarxiv icon

Current Agents Fail to Leverage World Model as Tool for Foresight

Add code
Jan 08, 2026
Viaarxiv icon

Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Figure 2 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Figure 3 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Figure 4 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Viaarxiv icon

Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum

Add code
Oct 01, 2025
Viaarxiv icon

Perception-Aware Policy Optimization for Multimodal Reasoning

Add code
Jul 08, 2025
Figure 1 for Perception-Aware Policy Optimization for Multimodal Reasoning
Figure 2 for Perception-Aware Policy Optimization for Multimodal Reasoning
Figure 3 for Perception-Aware Policy Optimization for Multimodal Reasoning
Figure 4 for Perception-Aware Policy Optimization for Multimodal Reasoning
Viaarxiv icon

DecisionFlow: Advancing Large Language Model as Principled Decision Maker

Add code
May 27, 2025
Viaarxiv icon

Graph Foundation Models: A Comprehensive Survey

Add code
May 21, 2025
Viaarxiv icon