Picture for Yao Shu

Yao Shu

Compander-Aligned Query Geometry for Quantized Zeroth-Order Optimization

Add code
May 11, 2026
Viaarxiv icon

Why Zeroth-Order Adaptation May Forget Less: A Randomized Shaping Theory

Add code
May 11, 2026
Viaarxiv icon

Reference-Sampled Boltzmann Projection for KL-Regularized RLVR: Target-Matched Weighted SFT, Finite One-Shot Gaps, and Policy Mirror Descent

Add code
May 04, 2026
Viaarxiv icon

Can We Change the Stroke Size for Easier Diffusion?

Add code
Mar 25, 2026
Viaarxiv icon

Multinoulli Extension: A Lossless Continuous Relaxation for Partition-Constrained Subset Selection

Add code
Mar 23, 2026
Viaarxiv icon

Model-based Offline RL via Robust Value-Aware Model Learning with Implicitly Differentiable Adaptive Weighting

Add code
Mar 09, 2026
Viaarxiv icon

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

Add code
Mar 03, 2026
Viaarxiv icon

MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

Add code
Mar 03, 2026
Viaarxiv icon

LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models

Add code
Mar 02, 2026
Viaarxiv icon

Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation

Add code
Mar 02, 2026
Viaarxiv icon