Picture for Yaodong Yang

Yaodong Yang

SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning

Add code
Mar 05, 2025
Figure 1 for SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning
Figure 2 for SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning
Figure 3 for SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning
Figure 4 for SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning
Viaarxiv icon

Differentiable Information Enhanced Model-Based Reinforcement Learning

Add code
Mar 03, 2025
Viaarxiv icon

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Add code
Feb 28, 2025
Viaarxiv icon

Retrieval Dexterity: Efficient Object Retrieval in Clutters with Dexterous Hand

Add code
Feb 26, 2025
Viaarxiv icon

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

Add code
Feb 26, 2025
Figure 1 for Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
Figure 2 for Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
Figure 3 for Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
Figure 4 for Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
Viaarxiv icon

SAE-V: Interpreting Multimodal Models for Enhanced Alignment

Add code
Feb 22, 2025
Viaarxiv icon

Model Evolution Framework with Genetic Algorithm for Multi-Task Reinforcement Learning

Add code
Feb 19, 2025
Figure 1 for Model Evolution Framework with Genetic Algorithm for Multi-Task Reinforcement Learning
Figure 2 for Model Evolution Framework with Genetic Algorithm for Multi-Task Reinforcement Learning
Figure 3 for Model Evolution Framework with Genetic Algorithm for Multi-Task Reinforcement Learning
Figure 4 for Model Evolution Framework with Genetic Algorithm for Multi-Task Reinforcement Learning
Viaarxiv icon

Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer

Add code
Feb 04, 2025
Figure 1 for Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer
Figure 2 for Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer
Figure 3 for Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer
Figure 4 for Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer
Viaarxiv icon

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

Add code
Jan 20, 2025
Viaarxiv icon

Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction

Add code
Jan 09, 2025
Viaarxiv icon