Picture for Mubarak Shah

Mubarak Shah

VidTAG: Temporally Aligned Video to GPS Geolocalization with Denoising Sequence Prediction at a Global Scale

Add code
Apr 14, 2026
Viaarxiv icon

ViLL-E: Video LLM Embeddings for Retrieval

Add code
Apr 13, 2026
Viaarxiv icon

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs

Add code
Apr 10, 2026
Viaarxiv icon

Learnability-Guided Diffusion for Dataset Distillation

Add code
Apr 01, 2026
Viaarxiv icon

Enhancing Box and Block Test with Computer Vision for Post-Stroke Upper Extremity Motor Evaluation

Add code
Mar 31, 2026
Viaarxiv icon

Seeing to Ground: Visual Attention for Hallucination-Resilient MDLLMs

Add code
Mar 26, 2026
Viaarxiv icon

TIGeR: A Unified Framework for Time, Images and Geo-location Retrieval

Add code
Mar 25, 2026
Viaarxiv icon

Curriculum-DPO++: Direct Preference Optimization via Data and Model Curricula for Text-to-Image Generation

Add code
Feb 13, 2026
Viaarxiv icon

Learning to Share: Selective Memory for Efficient Parallel Agentic Systems

Add code
Feb 05, 2026
Viaarxiv icon

CoRe: Context-Robust Remasking for Diffusion Language Models

Add code
Feb 04, 2026
Viaarxiv icon