Picture for Yaodong Yang

Yaodong Yang

Redefining Superalignment: From Weak-to-Strong Alignment to Human-AI Co-Alignment to Sustainable Symbiotic Society

Add code
Apr 24, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

Benchmarking Multi-National Value Alignment for Large Language Models

Add code
Apr 19, 2025
Viaarxiv icon

Dexterous Non-Prehensile Manipulation for Ungraspable Object via Extrinsic Dexterity

Add code
Mar 29, 2025
Viaarxiv icon

ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs

Add code
Mar 17, 2025
Viaarxiv icon

SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning

Add code
Mar 05, 2025
Viaarxiv icon

Differentiable Information Enhanced Model-Based Reinforcement Learning

Add code
Mar 03, 2025
Viaarxiv icon

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Add code
Feb 28, 2025
Viaarxiv icon

Retrieval Dexterity: Efficient Object Retrieval in Clutters with Dexterous Hand

Add code
Feb 26, 2025
Viaarxiv icon

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

Add code
Feb 26, 2025
Viaarxiv icon