Picture for Xuehai Wang

Xuehai Wang

QuarkMedBench: A Real-World Scenario Driven Benchmark for Evaluating Large Language Models

Add code
Mar 14, 2026
Viaarxiv icon

InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem

Add code
Feb 16, 2026
Viaarxiv icon