Picture for Dongyi Ding

Dongyi Ding

Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning

Add code
Oct 21, 2025
Viaarxiv icon

MiCoTA: Bridging the Learnability Gap with Intermediate CoT and Teacher Assistants

Add code
Jul 02, 2025
Viaarxiv icon

PersonaFeedback: A Large-scale Human-annotated Benchmark For Personalization

Add code
Jun 15, 2025
Viaarxiv icon