Picture for Chuanhao Li

Chuanhao Li

Multi-Sourced Compositional Generalization in Visual Question Answering

Add code
May 29, 2025
Viaarxiv icon

SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model

Add code
May 28, 2025
Viaarxiv icon

Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition

Add code
May 23, 2025
Viaarxiv icon

IA-T2I: Internet-Augmented Text-to-Image Generation

Add code
May 21, 2025
Viaarxiv icon

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon

ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy

Add code
Mar 09, 2025
Viaarxiv icon

Consistency of Compositional Generalization across Multiple Levels

Add code
Dec 18, 2024
Figure 1 for Consistency of Compositional Generalization across Multiple Levels
Figure 2 for Consistency of Compositional Generalization across Multiple Levels
Figure 3 for Consistency of Compositional Generalization across Multiple Levels
Figure 4 for Consistency of Compositional Generalization across Multiple Levels
Viaarxiv icon

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Add code
Dec 01, 2024
Viaarxiv icon

PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference

Add code
Oct 29, 2024
Viaarxiv icon

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Add code
Aug 05, 2024
Figure 1 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Figure 2 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Figure 3 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Figure 4 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Viaarxiv icon