Picture for Yu Yang

Yu Yang

Celine

SPIRAL: A Closed-Loop Framework for Self-Improving Action World Models via Reflective Planning Agents

Add code
Mar 11, 2026
Viaarxiv icon

Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning

Add code
Feb 24, 2026
Viaarxiv icon

Mobility-Aware Cache Framework for Scalable LLM-Based Human Mobility Simulation

Add code
Feb 17, 2026
Viaarxiv icon

Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution

Add code
Feb 13, 2026
Viaarxiv icon

Efficient and Stable Reinforcement Learning for Diffusion Language Models

Add code
Feb 09, 2026
Viaarxiv icon

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Add code
Feb 06, 2026
Viaarxiv icon

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Add code
Feb 03, 2026
Viaarxiv icon

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

Add code
Jan 29, 2026
Viaarxiv icon

FedRD: Reducing Divergences for Generalized Federated Learning via Heterogeneity-aware Parameter Guidance

Add code
Jan 28, 2026
Viaarxiv icon

FedCCA: Client-Centric Adaptation against Data Heterogeneity in Federated Learning on IoT Devices

Add code
Jan 25, 2026
Viaarxiv icon