Picture for Yang Yu

Yang Yu

Tsinghua University

MiniOpt: Reasoning to Model and Solve General Optimization Problems with Limited Resources

Add code
Jun 25, 2026
Viaarxiv icon

Offline Multi-agent Continual Cooperation via Skill Partition and Reuse

Add code
Jun 24, 2026
Viaarxiv icon

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

Add code
Jun 17, 2026
Viaarxiv icon

How Should World Models Be Evaluated? A Decision-Making-Centric Position

Add code
Jun 13, 2026
Viaarxiv icon

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Jun 12, 2026
Viaarxiv icon

Autonomous Aerial Manipulation via Contextual Contrastive Meta Reinforcement Learning

Add code
Jun 07, 2026
Viaarxiv icon

Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning

Add code
Jun 06, 2026
Viaarxiv icon

Continual Quadruped Robots Coordination via Semantic Skill Discovery

Add code
Jun 06, 2026
Viaarxiv icon

MedHorizon: Towards Long-context Medical Video Understanding in the Wild

Add code
May 07, 2026
Viaarxiv icon

Adversarial Imitation Learning with General Function Approximation: Theoretical Analysis and Practical Algorithms

Add code
May 03, 2026
Viaarxiv icon