Picture for Yaodong Yang

Yaodong Yang

CABTO: Context-Aware Behavior Tree Grounding for Robot Manipulation

Add code
Mar 17, 2026
Viaarxiv icon

Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning

Add code
Mar 11, 2026
Viaarxiv icon

VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

Add code
Mar 05, 2026
Viaarxiv icon

Heterogeneous Agent Collaborative Reinforcement Learning

Add code
Mar 03, 2026
Viaarxiv icon

MVR: Multi-view Video Reward Shaping for Reinforcement Learning

Add code
Mar 02, 2026
Viaarxiv icon

RMBench: Memory-Dependent Robotic Manipulation Benchmark with Insights into Policy Design

Add code
Mar 01, 2026
Viaarxiv icon

Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

Add code
Feb 18, 2026
Viaarxiv icon

FormalJudge: A Neuro-Symbolic Paradigm for Agentic Oversight

Add code
Feb 12, 2026
Viaarxiv icon

ECO: Energy-Constrained Optimization with Reinforcement Learning for Humanoid Walking

Add code
Feb 06, 2026
Viaarxiv icon

Enhance the Safety in Reinforcement Learning by ADRC Lagrangian Methods

Add code
Jan 26, 2026
Viaarxiv icon