Picture for Qing-Guo Chen

Qing-Guo Chen

Walk the Talk: Bridging the Reasoning-Action Gap for Thinking with Images via Multimodal Agentic Policy Optimization

Add code
Apr 08, 2026
Viaarxiv icon

Training-Free Image Editing with Visual Context Integration and Concept Alignment

Add code
Apr 06, 2026
Viaarxiv icon

M$^2$: Dual-Memory Augmentation for Long-Horizon Web Agents via Trajectory Summarization and Insight Retrieval

Add code
Feb 28, 2026
Viaarxiv icon

Building Autonomous GUI Navigation via Agentic-Q Estimation and Step-Wise Policy Optimization

Add code
Feb 14, 2026
Viaarxiv icon

Adaptive Debiasing Tsallis Entropy for Test-Time Adaptation

Add code
Feb 12, 2026
Viaarxiv icon

Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs

Add code
Jan 13, 2026
Viaarxiv icon

Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images

Add code
Dec 19, 2025
Figure 1 for Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
Figure 2 for Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
Figure 3 for Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
Figure 4 for Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
Viaarxiv icon

Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images

Add code
Nov 10, 2025
Viaarxiv icon

LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization

Add code
Jun 11, 2025
Figure 1 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 2 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 3 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 4 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Viaarxiv icon

Multimodal Tabular Reasoning with Privileged Structured Information

Add code
Jun 04, 2025
Viaarxiv icon