Picture for Huiyu Duan

Huiyu Duan

Enhancing Image Quality Assessment Ability of LMMs via Retrieval-Augmented Generation

Add code
Jan 13, 2026
Viaarxiv icon

Agentic Retoucher for Text-To-Image Generation

Add code
Jan 08, 2026
Viaarxiv icon

Robust Mesh Saliency GT Acquisition in VR via View Cone Sampling and Geometric Smoothing

Add code
Jan 06, 2026
Viaarxiv icon

VTONQA: A Multi-Dimensional Quality Assessment Dataset for Virtual Try-on

Add code
Jan 06, 2026
Viaarxiv icon

Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs

Add code
Dec 19, 2025
Figure 1 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Figure 2 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Figure 3 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Figure 4 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Viaarxiv icon

ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation

Add code
Nov 18, 2025
Figure 1 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Figure 2 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Figure 3 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Figure 4 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Viaarxiv icon

GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models

Add code
Nov 17, 2025
Viaarxiv icon

TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs

Add code
May 26, 2025
Viaarxiv icon

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

Add code
May 22, 2025
Viaarxiv icon

LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation

Add code
May 17, 2025
Viaarxiv icon