Picture for Jiuzheng Wang

Jiuzheng Wang

Teaching AI Through Benchmark Construction: QuestBench as a Course-Based Practice for Accountable Knowledge Work

Add code
May 21, 2026
Viaarxiv icon

DeepWeb-Bench: A Deep Research Benchmark Demanding Massive Cross-Source Evidence and Long-Horizon Derivation

Add code
May 20, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization

Add code
May 16, 2025
Viaarxiv icon