Picture for Qing-Guo Chen

Qing-Guo Chen

Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images

Add code
Dec 19, 2025
Viaarxiv icon

Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images

Add code
Nov 10, 2025
Viaarxiv icon

LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization

Add code
Jun 11, 2025
Figure 1 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 2 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 3 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 4 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Viaarxiv icon

Multimodal Tabular Reasoning with Privileged Structured Information

Add code
Jun 04, 2025
Viaarxiv icon

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Add code
May 05, 2025
Viaarxiv icon

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation

Add code
Feb 18, 2025
Figure 1 for CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation
Figure 2 for CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation
Figure 3 for CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation
Figure 4 for CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation
Viaarxiv icon

Multi-Label Test-Time Adaptation with Bound Entropy Minimization

Add code
Feb 06, 2025
Figure 1 for Multi-Label Test-Time Adaptation with Bound Entropy Minimization
Figure 2 for Multi-Label Test-Time Adaptation with Bound Entropy Minimization
Figure 3 for Multi-Label Test-Time Adaptation with Bound Entropy Minimization
Figure 4 for Multi-Label Test-Time Adaptation with Bound Entropy Minimization
Viaarxiv icon

Evaluating Image Caption via Cycle-consistent Text-to-Image Generation

Add code
Jan 08, 2025
Figure 1 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 2 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 3 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 4 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Viaarxiv icon

MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs

Add code
Jan 06, 2025
Viaarxiv icon

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Add code
Dec 25, 2024
Figure 1 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 2 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 3 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 4 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Viaarxiv icon