Picture for Imran Kabir

Imran Kabir

IKIWISI: An Interactive Visual Pattern Generator for Evaluating the Reliability of Vision-Language Models Without Ground Truth

Add code
May 28, 2025
Viaarxiv icon

Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding

Add code
Mar 16, 2025
Viaarxiv icon

Identifying Crucial Objects in Blind and Low-Vision Individuals' Navigation

Add code
Aug 23, 2024
Viaarxiv icon

A Dataset for Crucial Object Recognition in Blind and Low-Vision Individuals' Navigation

Add code
Jul 23, 2024
Viaarxiv icon