Picture for Chongyang Pan

Chongyang Pan

Teaching AI Through Benchmark Construction: QuestBench as a Course-Based Practice for Accountable Knowledge Work

Add code
May 21, 2026
Viaarxiv icon

MindLoom: Composing Thought Modes for Frontier-Level Reasoning Data Synthesis

Add code
May 20, 2026
Viaarxiv icon

DeepWeb-Bench: A Deep Research Benchmark Demanding Massive Cross-Source Evidence and Long-Horizon Derivation

Add code
May 20, 2026
Viaarxiv icon