Picture for Zexi Kuang

Zexi Kuang

SciVer: Evaluating Foundation Models for Multimodal Scientific Claim Verification

Add code
Jun 18, 2025
Viaarxiv icon

Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure

Add code
Jun 13, 2025
Viaarxiv icon