Picture for Haoquan Zhang

Haoquan Zhang

AI Idea Bench 2025: AI Research Idea Generation Benchmark

Add code
Apr 19, 2025
Viaarxiv icon

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon