Picture for Mubarak Shah

Mubarak Shah

Aero-World: Action-Conditioned Aerial Video Generation from Inertial Controls

Add code
May 19, 2026
Viaarxiv icon

Weakly-Supervised Spatiotemporal Anomaly Detection

Add code
May 13, 2026
Viaarxiv icon

Dystruct: Dynamically Structured Diffusion Language Model Decoding via Bayesian Inference

Add code
May 10, 2026
Viaarxiv icon

VidTAG: Temporally Aligned Video to GPS Geolocalization with Denoising Sequence Prediction at a Global Scale

Add code
Apr 14, 2026
Viaarxiv icon

ViLL-E: Video LLM Embeddings for Retrieval

Add code
Apr 13, 2026
Viaarxiv icon

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs

Add code
Apr 10, 2026
Viaarxiv icon

Learnability-Guided Diffusion for Dataset Distillation

Add code
Apr 01, 2026
Viaarxiv icon

Enhancing Box and Block Test with Computer Vision for Post-Stroke Upper Extremity Motor Evaluation

Add code
Mar 31, 2026
Viaarxiv icon

Seeing to Ground: Visual Attention for Hallucination-Resilient MDLLMs

Add code
Mar 26, 2026
Viaarxiv icon

TIGeR: A Unified Framework for Time, Images and Geo-location Retrieval

Add code
Mar 25, 2026
Viaarxiv icon