Picture for Huu-Thien Tran

Huu-Thien Tran

Directed-Tokens: A Robust Multi-Modality Alignment Approach to Large Language-Vision Models

Add code
Aug 19, 2025
Viaarxiv icon

BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models

Add code
May 30, 2025
Figure 1 for BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models
Figure 2 for BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models
Figure 3 for BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models
Figure 4 for BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models
Viaarxiv icon

MEX: Memory-efficient Approach to Referring Multi-Object Tracking

Add code
Feb 19, 2025
Viaarxiv icon