Picture for Wangbo Zhao

Wangbo Zhao

MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams

Add code
Aug 09, 2025
Viaarxiv icon

REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training

Add code
May 22, 2025
Viaarxiv icon

DD-Ranking: Rethinking the Evaluation of Dataset Distillation

Add code
May 19, 2025
Viaarxiv icon

Unsupervised Learning for Class Distribution Mismatch

Add code
May 11, 2025
Viaarxiv icon

DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation

Add code
Apr 09, 2025
Viaarxiv icon

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon

Dynamic Vision Mamba

Add code
Apr 07, 2025
Figure 1 for Dynamic Vision Mamba
Figure 2 for Dynamic Vision Mamba
Figure 3 for Dynamic Vision Mamba
Figure 4 for Dynamic Vision Mamba
Viaarxiv icon

MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification

Add code
Mar 16, 2025
Figure 1 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 2 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 3 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 4 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Viaarxiv icon

PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models

Add code
Mar 16, 2025
Viaarxiv icon

Recurrent Diffusion for Large-Scale Parameter Generation

Add code
Jan 20, 2025
Figure 1 for Recurrent Diffusion for Large-Scale Parameter Generation
Figure 2 for Recurrent Diffusion for Large-Scale Parameter Generation
Figure 3 for Recurrent Diffusion for Large-Scale Parameter Generation
Figure 4 for Recurrent Diffusion for Large-Scale Parameter Generation
Viaarxiv icon