Picture for Mingrui Wu

Mingrui Wu

Alibaba Group

PromptEnhancer: A Simple Approach to Enhance Text-to-Image Models via Chain-of-Thought Prompt Rewriting

Add code
Sep 04, 2025
Viaarxiv icon

MIHBench: Benchmarking and Mitigating Multi-Image Hallucinations in Multimodal Large Language Models

Add code
Aug 01, 2025
Viaarxiv icon

RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning

Add code
May 23, 2025
Viaarxiv icon

Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

Add code
Apr 23, 2025
Viaarxiv icon

Vision Calorimeter for Anti-neutron Reconstruction: A Baseline

Add code
Aug 20, 2024
Figure 1 for Vision Calorimeter for Anti-neutron Reconstruction: A Baseline
Figure 2 for Vision Calorimeter for Anti-neutron Reconstruction: A Baseline
Figure 3 for Vision Calorimeter for Anti-neutron Reconstruction: A Baseline
Figure 4 for Vision Calorimeter for Anti-neutron Reconstruction: A Baseline
Viaarxiv icon

TraDiffusion: Trajectory-Based Training-Free Image Generation

Add code
Aug 19, 2024
Figure 1 for TraDiffusion: Trajectory-Based Training-Free Image Generation
Figure 2 for TraDiffusion: Trajectory-Based Training-Free Image Generation
Figure 3 for TraDiffusion: Trajectory-Based Training-Free Image Generation
Figure 4 for TraDiffusion: Trajectory-Based Training-Free Image Generation
Viaarxiv icon

ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models

Add code
Jul 31, 2024
Viaarxiv icon

Evaluating and Analyzing Relationship Hallucinations in LVLMs

Add code
Jun 24, 2024
Figure 1 for Evaluating and Analyzing Relationship Hallucinations in LVLMs
Figure 2 for Evaluating and Analyzing Relationship Hallucinations in LVLMs
Figure 3 for Evaluating and Analyzing Relationship Hallucinations in LVLMs
Figure 4 for Evaluating and Analyzing Relationship Hallucinations in LVLMs
Viaarxiv icon

LLM-Augmented Retrieval: Enhancing Retrieval Models Through Language Models and Doc-Level Embedding

Add code
Apr 08, 2024
Viaarxiv icon

End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation

Add code
Apr 01, 2022
Figure 1 for End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation
Figure 2 for End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation
Figure 3 for End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation
Figure 4 for End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation
Viaarxiv icon