Picture for Mingyang Wu

Mingyang Wu

CV-Arena: An Open Benchmark for Instructional Computer Vision Problem Solving with Human-AI Collaborative Preferences

Add code
May 30, 2026
Viaarxiv icon

4KLSDB: A Large-Scale Dataset for 4K Image Restoration and Generation

Add code
May 23, 2026
Viaarxiv icon

The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

Add code
Mar 15, 2026
Viaarxiv icon

MindPilot: Closed-loop Visual Stimulation Optimization for Brain Modulation with EEG-guided Diffusion

Add code
Feb 11, 2026
Viaarxiv icon

ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation

Add code
Feb 10, 2026
Viaarxiv icon

ChineseEEG-2: An EEG Dataset for Multimodal Semantic Alignment and Neural Decoding during Reading and Listening

Add code
Aug 06, 2025
Viaarxiv icon

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Add code
Jul 16, 2025
Figure 1 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 2 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 3 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 4 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Viaarxiv icon

4KAgent: Agentic Any Image to 4K Super-Resolution

Add code
Jul 09, 2025
Viaarxiv icon

Preserving AUC Fairness in Learning with Noisy Protected Groups

Add code
May 24, 2025
Viaarxiv icon

VISTA: Generative Visual Imagination for Vision-and-Language Navigation

Add code
May 17, 2025
Figure 1 for VISTA: Generative Visual Imagination for Vision-and-Language Navigation
Figure 2 for VISTA: Generative Visual Imagination for Vision-and-Language Navigation
Figure 3 for VISTA: Generative Visual Imagination for Vision-and-Language Navigation
Figure 4 for VISTA: Generative Visual Imagination for Vision-and-Language Navigation
Viaarxiv icon