Picture for Raoyuan Zhao

Raoyuan Zhao

Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing

Add code
May 27, 2025
Viaarxiv icon

MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs

Add code
May 27, 2025
Viaarxiv icon

What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns

Add code
Apr 22, 2025
Viaarxiv icon

SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists

Add code
Aug 30, 2024
Viaarxiv icon