Picture for Qing-Guo Chen

Qing-Guo Chen

LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization

Add code
Jun 11, 2025
Viaarxiv icon

Multimodal Tabular Reasoning with Privileged Structured Information

Add code
Jun 04, 2025
Viaarxiv icon

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Add code
May 05, 2025
Viaarxiv icon

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation

Add code
Feb 18, 2025
Viaarxiv icon

Multi-Label Test-Time Adaptation with Bound Entropy Minimization

Add code
Feb 06, 2025
Figure 1 for Multi-Label Test-Time Adaptation with Bound Entropy Minimization
Figure 2 for Multi-Label Test-Time Adaptation with Bound Entropy Minimization
Figure 3 for Multi-Label Test-Time Adaptation with Bound Entropy Minimization
Figure 4 for Multi-Label Test-Time Adaptation with Bound Entropy Minimization
Viaarxiv icon

Evaluating Image Caption via Cycle-consistent Text-to-Image Generation

Add code
Jan 08, 2025
Figure 1 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 2 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 3 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 4 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Viaarxiv icon

MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs

Add code
Jan 06, 2025
Viaarxiv icon

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Add code
Dec 25, 2024
Figure 1 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 2 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 3 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 4 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Viaarxiv icon

OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions

Add code
Dec 09, 2024
Viaarxiv icon

PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm

Add code
Dec 05, 2024
Figure 1 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Figure 2 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Figure 3 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Figure 4 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Viaarxiv icon