Picture for Xiusi Chen

Xiusi Chen

Department of Computer Science, University of California, Los Angeles

Perception-Aware Policy Optimization for Multimodal Reasoning

Add code
Jul 08, 2025
Viaarxiv icon

DecisionFlow: Advancing Large Language Model as Principled Decision Maker

Add code
May 27, 2025
Viaarxiv icon

ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges

Add code
May 21, 2025
Viaarxiv icon

Graph Foundation Models: A Comprehensive Survey

Add code
May 21, 2025
Viaarxiv icon

RM-R1: Reward Modeling as Reasoning

Add code
May 05, 2025
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Viaarxiv icon

SMART: Self-Aware Agent for Tool Overuse Mitigation

Add code
Feb 17, 2025
Figure 1 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 2 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 3 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 4 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Viaarxiv icon

Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis

Add code
Feb 06, 2025
Viaarxiv icon

Internal Activation as the Polar Star for Steering Unsafe LLM Behavior

Add code
Feb 04, 2025
Figure 1 for Internal Activation as the Polar Star for Steering Unsafe LLM Behavior
Figure 2 for Internal Activation as the Polar Star for Steering Unsafe LLM Behavior
Figure 3 for Internal Activation as the Polar Star for Steering Unsafe LLM Behavior
Figure 4 for Internal Activation as the Polar Star for Steering Unsafe LLM Behavior
Viaarxiv icon