Picture for Shuo Yang

Shuo Yang

DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

Add code
Apr 21, 2026
Viaarxiv icon

Well Begun is Half Done: Training-Free and Model-Agnostic Semantically Guaranteed User Representation Initialization for Multimodal Recommendation

Add code
Apr 16, 2026
Viaarxiv icon

GenomeQA: Benchmarking General Large Language Models for Genome Sequence Understanding

Add code
Apr 07, 2026
Viaarxiv icon

Asymmetric Actor-Critic for Multi-turn LLM Agents

Add code
Mar 31, 2026
Viaarxiv icon

Beyond Where to Look: Trajectory-Guided Reinforcement Learning for Multimodal RLVR

Add code
Mar 27, 2026
Viaarxiv icon

Bridging Perception and Reasoning: Token Reweighting for RLVR in Multimodal LLMs

Add code
Mar 26, 2026
Viaarxiv icon

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Add code
Mar 23, 2026
Viaarxiv icon

Attention in Space: Functional Roles of VLM Heads for Spatial Reasoning

Add code
Mar 21, 2026
Viaarxiv icon

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Add code
Mar 20, 2026
Viaarxiv icon

CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think

Add code
Mar 19, 2026
Viaarxiv icon