Picture for Minh Khoi Ho

Minh Khoi Ho

Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs

Add code
May 05, 2025
Viaarxiv icon

TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning

Add code
Apr 14, 2024
Viaarxiv icon