Picture for Xiusi Chen

Xiusi Chen

May

Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning

Add code
Oct 02, 2025
Viaarxiv icon

Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum

Add code
Oct 01, 2025
Viaarxiv icon

Perception-Aware Policy Optimization for Multimodal Reasoning

Add code
Jul 08, 2025
Figure 1 for Perception-Aware Policy Optimization for Multimodal Reasoning
Figure 2 for Perception-Aware Policy Optimization for Multimodal Reasoning
Figure 3 for Perception-Aware Policy Optimization for Multimodal Reasoning
Figure 4 for Perception-Aware Policy Optimization for Multimodal Reasoning
Viaarxiv icon

DecisionFlow: Advancing Large Language Model as Principled Decision Maker

Add code
May 27, 2025
Viaarxiv icon

ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges

Add code
May 21, 2025
Figure 1 for ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges
Figure 2 for ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges
Figure 3 for ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges
Figure 4 for ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges
Viaarxiv icon

Graph Foundation Models: A Comprehensive Survey

Add code
May 21, 2025
Viaarxiv icon

RM-R1: Reward Modeling as Reasoning

Add code
May 05, 2025
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Figure 1 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 2 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 3 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 4 for OTC: Optimal Tool Calls via Reinforcement Learning
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Viaarxiv icon

SMART: Self-Aware Agent for Tool Overuse Mitigation

Add code
Feb 17, 2025
Figure 1 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 2 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 3 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 4 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Viaarxiv icon