Picture for Shuo Yang

Shuo Yang

StaminaBench: Stress-Testing Coding Agents over 100 Interaction Turns

Add code
Jun 17, 2026
Viaarxiv icon

LLMZero: Discovering Adaptive Training Strategies for RL Post-Training via LLM Agents

Add code
Jun 16, 2026
Viaarxiv icon

MotionWAM: Towards Foundation World Action Models for Real-Time Humanoid Loco-Manipulation

Add code
Jun 08, 2026
Viaarxiv icon

Perceptive Behavior Foundation Model: Adapting Human Motion Priors to Robot-Centric Terrain

Add code
Jun 06, 2026
Viaarxiv icon

DLLG: Dynamic Logit-Level Gating of LLM Experts

Add code
Jun 03, 2026
Viaarxiv icon

Consolidating Rewarded Perturbations for LLM Post-Training

Add code
May 29, 2026
Viaarxiv icon

LATTE: Forecasting Peer Anchored Preference Trajectories for Personalized LLM Generation

Add code
May 26, 2026
Viaarxiv icon

TapSampling: Inference-Time Sampling with a Task-Progress-Understanding Verifier for Robotic Manipulation

Add code
May 25, 2026
Viaarxiv icon

Beyond the Target: From Imitation to Collaboration in Speculative Decoding

Add code
May 24, 2026
Viaarxiv icon

One-Way Policy Optimization for Self-Evolving LLMs

Add code
May 21, 2026
Viaarxiv icon