Picture for Qiming Li

Qiming Li

CAST: Mitigating Object Hallucination in Large Vision-Language Models via Caption-Guided Visual Attention Steering

Add code
May 06, 2026
Viaarxiv icon

Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play

Add code
Apr 20, 2026
Viaarxiv icon

Not All Tokens See Equally: Perception-Grounded Policy Optimization for Large Vision-Language Models

Add code
Apr 02, 2026
Viaarxiv icon

Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation

Add code
Nov 19, 2025
Figure 1 for Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation
Figure 2 for Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation
Figure 3 for Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation
Figure 4 for Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation
Viaarxiv icon

GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting

Add code
Aug 31, 2024
Figure 1 for GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting
Figure 2 for GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting
Figure 3 for GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting
Figure 4 for GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting
Viaarxiv icon

Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models

Add code
Jun 30, 2024
Figure 1 for Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models
Figure 2 for Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models
Figure 3 for Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models
Figure 4 for Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models
Viaarxiv icon