Picture for Yao Shu

Yao Shu

Multinoulli Extension: A Lossless Continuous Relaxation for Partition-Constrained Subset Selection

Add code
Mar 23, 2026
Viaarxiv icon

Model-based Offline RL via Robust Value-Aware Model Learning with Implicitly Differentiable Adaptive Weighting

Add code
Mar 09, 2026
Viaarxiv icon

MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

Add code
Mar 03, 2026
Viaarxiv icon

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

Add code
Mar 03, 2026
Viaarxiv icon

Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation

Add code
Mar 02, 2026
Viaarxiv icon

LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models

Add code
Mar 02, 2026
Viaarxiv icon

Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling

Add code
Feb 03, 2026
Viaarxiv icon

1S-DAug: One-Shot Data Augmentation for Robust Few-Shot Generalization

Add code
Jan 27, 2026
Viaarxiv icon

Controllable Concept Bottleneck Models

Add code
Jan 01, 2026
Viaarxiv icon

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives

Add code
Sep 26, 2025
Figure 1 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Figure 2 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Figure 3 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Figure 4 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Viaarxiv icon