Picture for Kaipeng Zhang

Kaipeng Zhang

EvoMoE: Expert Evolution in Mixture of Experts for Multimodal Large Language Models

Add code
May 28, 2025
Viaarxiv icon

SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model

Add code
May 28, 2025
Viaarxiv icon

REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training

Add code
May 22, 2025
Viaarxiv icon

IA-T2I: Internet-Augmented Text-to-Image Generation

Add code
May 21, 2025
Viaarxiv icon

DD-Ranking: Rethinking the Evaluation of Dataset Distillation

Add code
May 19, 2025
Viaarxiv icon

Human-Aligned Bench: Fine-Grained Assessment of Reasoning Ability in MLLMs vs. Humans

Add code
May 16, 2025
Viaarxiv icon

AI Idea Bench 2025: AI Research Idea Generation Benchmark

Add code
Apr 19, 2025
Viaarxiv icon

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Add code
Apr 15, 2025
Viaarxiv icon

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Add code
Mar 27, 2025
Viaarxiv icon