Picture for Xu Guo

Xu Guo

Rethinking Multiple-Choice Questions for RLVR: Unlocking Potential via Distractor Design

Add code
Mar 13, 2026
Viaarxiv icon

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

Add code
Feb 12, 2026
Viaarxiv icon

AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection

Add code
Feb 09, 2026
Viaarxiv icon

Self-Verification Dilemma: Experience-Driven Suppression of Overused Checking in LLM Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

Add code
Jan 04, 2026
Viaarxiv icon

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

Add code
Dec 23, 2025
Viaarxiv icon

Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency

Add code
Sep 17, 2025
Viaarxiv icon

X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning

Add code
Aug 11, 2025
Viaarxiv icon

IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

Add code
Aug 06, 2025
Viaarxiv icon

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

Add code
Jun 16, 2025
Viaarxiv icon