Picture for Mubarak Shah

Mubarak Shah

A Culturally-diverse Multilingual Multimodal Video Benchmark & Model

Add code
Jun 08, 2025
Viaarxiv icon

From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos

Add code
Jun 05, 2025
Viaarxiv icon

Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks

Add code
May 30, 2025
Viaarxiv icon

MGD$^3$: Mode-Guided Dataset Distillation using Diffusion Models

Add code
May 25, 2025
Viaarxiv icon

Multi-Party Conversational Agents: A Survey

Add code
May 24, 2025
Viaarxiv icon

SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding

Add code
May 22, 2025
Viaarxiv icon

HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation

Add code
May 16, 2025
Viaarxiv icon

MAVOS-DD: Multilingual Audio-Video Open-Set Deepfake Detection Benchmark

Add code
May 16, 2025
Viaarxiv icon

Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models

Add code
Apr 25, 2025
Viaarxiv icon

On Transfer-based Universal Attacks in Pure Black-box Setting

Add code
Apr 11, 2025
Viaarxiv icon