Picture for Ji-Lun Peng

Ji-Lun Peng

Rethinking Role-Playing Evaluation: Anonymous Benchmarking and a Systematic Study of Personality Effects

Add code
Mar 04, 2026
Viaarxiv icon

A Survey of Useful LLM Evaluation

Add code
Jun 03, 2024
Viaarxiv icon