Picture for Weili Guan

Weili Guan

Mastering Negation: Boosting Grounding Models via Grouped Opposition-Based Learning

Add code
Mar 13, 2026
Viaarxiv icon

HATS: Hardness-Aware Trajectory Synthesis for GUI Agents

Add code
Mar 12, 2026
Viaarxiv icon

Do All Individual Layers Help? An Empirical Study of Task-Interfering Layers in Vision-Language Models

Add code
Feb 01, 2026
Viaarxiv icon

ConLA: Contrastive Latent Action Learning from Human Videos for Robotic Manipulation

Add code
Jan 31, 2026
Viaarxiv icon

CVeDRL: An Efficient Code Verifier via Difficulty-aware Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

StructAlign: Structured Cross-Modal Alignment for Continual Text-to-Video Retrieval

Add code
Jan 28, 2026
Viaarxiv icon

IOTA: Corrective Knowledge-Guided Prompt Learning via Black-White Box Framework

Add code
Jan 28, 2026
Viaarxiv icon

PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records

Add code
Jan 14, 2026
Viaarxiv icon

From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs

Add code
Sep 26, 2025
Figure 1 for From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs
Figure 2 for From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs
Figure 3 for From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs
Figure 4 for From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs
Viaarxiv icon

Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems

Add code
Sep 09, 2025
Figure 1 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 2 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 3 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 4 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Viaarxiv icon