Picture for Zhuozhi Xiong

Zhuozhi Xiong

The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing

Add code
Mar 12, 2024
Viaarxiv icon

Beyond the Obvious: Evaluating the Reasoning Ability In Real-life Scenarios of Language Models on Life Scapes Reasoning Benchmark~(LSR-Benchmark)

Add code
Jul 11, 2023
Viaarxiv icon

Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation

Add code
Jun 15, 2023
Viaarxiv icon

Domain Mastery Benchmark: An Ever-Updating Benchmark for Evaluating Holistic Domain Knowledge of Large Language Model--A Preliminary Release

Add code
Apr 23, 2023
Viaarxiv icon