Picture for Kaiwen Luo

Kaiwen Luo

Explaining and Breaking the Safety-Helpfulness Ceiling via Preference Dimensional Expansion

Add code
May 13, 2026
Viaarxiv icon

HearSay Benchmark: Do Audio LLMs Leak What They Hear?

Add code
Jan 07, 2026
Viaarxiv icon