Picture for Yaodong Yang

Yaodong Yang

SafeLawBench: Towards Safe Alignment of Large Language Models

Add code
Jun 07, 2025
Viaarxiv icon

SIV-Bench: A Video Benchmark for Social Interaction Understanding and Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback

Add code
May 29, 2025
Viaarxiv icon

Risk-aware Direct Preference Optimization under Nested Risk Measure

Add code
May 29, 2025
Viaarxiv icon

From Strangers to Assistants: Fast Desire Alignment for Embodied Agent-User Adaptation

Add code
May 28, 2025
Viaarxiv icon

The Mirage of Multimodality: Where Truth is Tested and Honesty Unravels

Add code
May 26, 2025
Viaarxiv icon

EuroCon: Benchmarking Parliament Deliberation for Political Consensus Finding

Add code
May 26, 2025
Viaarxiv icon

Generative RLHF-V: Learning Principles from Multi-modal Human Preference

Add code
May 24, 2025
Viaarxiv icon

Mitigating Deceptive Alignment via Self-Monitoring

Add code
May 24, 2025
Viaarxiv icon

Adaptive Visuo-Tactile Fusion with Predictive Force Attention for Dexterous Manipulation

Add code
May 20, 2025
Viaarxiv icon