Picture for Xiaohan Wang

Xiaohan Wang

Sue

GMPilot: An Expert AI Agent For FDA cGMP Compliance

Add code
Mar 21, 2026
Viaarxiv icon

Fine-tuning MLLMs Without Forgetting Is Easier Than You Think

Add code
Mar 15, 2026
Viaarxiv icon

CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling

Add code
Mar 09, 2026
Viaarxiv icon

SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training

Add code
Mar 03, 2026
Viaarxiv icon

Tool Verification for Test-Time Reinforcement Learning

Add code
Mar 02, 2026
Viaarxiv icon

Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards

Add code
Feb 09, 2026
Viaarxiv icon

Concise Geometric Description as a Bridge: Unleashing the Potential of LLM for Plane Geometry Problem Solving

Add code
Jan 29, 2026
Viaarxiv icon

Your Group-Relative Advantage Is Biased

Add code
Jan 13, 2026
Viaarxiv icon

RadDiff: Describing Differences in Radiology Image Sets with Natural Language

Add code
Jan 07, 2026
Viaarxiv icon

Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning

Add code
Dec 24, 2025
Figure 1 for Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning
Figure 2 for Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning
Figure 3 for Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning
Figure 4 for Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning
Viaarxiv icon