Picture for Raoyuan Zhao

Raoyuan Zhao

What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns

Add code
Apr 22, 2025
Viaarxiv icon

SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists

Add code
Aug 30, 2024
Viaarxiv icon