Picture for Imran Razzak

Imran Razzak

DocAtlas: Multilingual Document Understanding Across 80+ Languages

Add code
May 12, 2026
Viaarxiv icon

MAGE: Multi-Agent Self-Evolution with Co-Evolutionary Knowledge Graphs

Add code
May 11, 2026
Viaarxiv icon

See Fair, Speak Truth: Equitable Attention Improves Grounding and Reduces Hallucination in Vision-Language Alignment

Add code
Apr 10, 2026
Viaarxiv icon

CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning

Add code
Apr 03, 2026
Viaarxiv icon

SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia

Add code
Mar 20, 2026
Viaarxiv icon

TuLaBM: Tumor-Biased Latent Bridge Matching for Contrast-Enhanced MRI Synthesis

Add code
Mar 19, 2026
Viaarxiv icon

VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs

Add code
Mar 19, 2026
Viaarxiv icon

DeLo: Dual Decomposed Low-Rank Experts Collaboration for Continual Missing Modality Learning

Add code
Mar 02, 2026
Viaarxiv icon

ClinCoT: Clinical-Aware Visual Chain-of-Thought for Medical Vision Language Models

Add code
Mar 01, 2026
Viaarxiv icon

CMSA-Net: Causal Multi-scale Aggregation with Adaptive Multi-source Reference for Video Polyp Segmentation

Add code
Feb 26, 2026
Viaarxiv icon