Picture for Minh Khoi Ho

Minh Khoi Ho

Overthinking Causes Hallucination: Tracing Confounder Propagation in Vision Language Models

Add code
Mar 08, 2026
Viaarxiv icon

Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs

Add code
May 05, 2025
Viaarxiv icon

TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning

Add code
Apr 14, 2024
Figure 1 for TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
Figure 2 for TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
Figure 3 for TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
Figure 4 for TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
Viaarxiv icon