Picture for Shuo Yang

Shuo Yang

GenomeQA: Benchmarking General Large Language Models for Genome Sequence Understanding

Add code
Apr 07, 2026
Viaarxiv icon

Asymmetric Actor-Critic for Multi-turn LLM Agents

Add code
Mar 31, 2026
Viaarxiv icon

Beyond Where to Look: Trajectory-Guided Reinforcement Learning for Multimodal RLVR

Add code
Mar 27, 2026
Viaarxiv icon

Bridging Perception and Reasoning: Token Reweighting for RLVR in Multimodal LLMs

Add code
Mar 26, 2026
Viaarxiv icon

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Add code
Mar 23, 2026
Viaarxiv icon

Attention in Space: Functional Roles of VLM Heads for Spatial Reasoning

Add code
Mar 21, 2026
Viaarxiv icon

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Add code
Mar 20, 2026
Viaarxiv icon

CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think

Add code
Mar 19, 2026
Viaarxiv icon

GroupGuard: A Framework for Modeling and Defending Collusive Attacks in Multi-Agent Systems

Add code
Mar 14, 2026
Viaarxiv icon

ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning

Add code
Mar 13, 2026
Viaarxiv icon