Alert button
Picture for Fahad Shahbaz Khan

Fahad Shahbaz Khan

Alert button

ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes

Add code
Bookmark button
Alert button
Mar 15, 2024
Hashmat Shadab Malik, Muhammad Huzaifa, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan

Figure 1 for ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes
Figure 2 for ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes
Figure 3 for ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes
Figure 4 for ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes
Viaarxiv icon

Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery

Add code
Bookmark button
Alert button
Mar 08, 2024
Mubashir Noman, Muzammal Naseer, Hisham Cholakkal, Rao Muhammad Anwar, Salman Khan, Fahad Shahbaz Khan

Figure 1 for Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery
Figure 2 for Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery
Figure 3 for Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery
Figure 4 for Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery
Viaarxiv icon

Effectiveness Assessment of Recent Large Vision-Language Models

Add code
Bookmark button
Alert button
Mar 07, 2024
Yao Jiang, Xinyu Yan, Ge-Peng Ji, Keren Fu, Meijun Sun, Huan Xiong, Deng-Ping Fan, Fahad Shahbaz Khan

Figure 1 for Effectiveness Assessment of Recent Large Vision-Language Models
Figure 2 for Effectiveness Assessment of Recent Large Vision-Language Models
Figure 3 for Effectiveness Assessment of Recent Large Vision-Language Models
Figure 4 for Effectiveness Assessment of Recent Large Vision-Language Models
Viaarxiv icon

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

Add code
Bookmark button
Alert button
Feb 26, 2024
Omkar Thawakar, Ashmal Vayani, Salman Khan, Hisham Cholakal, Rao M. Anwer, Michael Felsberg, Tim Baldwin, Eric P. Xing, Fahad Shahbaz Khan

Viaarxiv icon

Semi-supervised Open-World Object Detection

Add code
Bookmark button
Alert button
Feb 25, 2024
Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkal

Viaarxiv icon

BiMediX: Bilingual Medical Mixture of Experts LLM

Add code
Bookmark button
Alert button
Feb 20, 2024
Sara Pieri, Sahal Shaji Mullappilly, Fahad Shahbaz Khan, Rao Muhammad Anwer, Salman Khan, Timothy Baldwin, Hisham Cholakkal

Viaarxiv icon

Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models

Add code
Bookmark button
Alert button
Feb 08, 2024
Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang

Viaarxiv icon

Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding

Add code
Bookmark button
Alert button
Dec 31, 2023
Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan

Viaarxiv icon

Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models

Add code
Bookmark button
Alert button
Dec 15, 2023
Senmao Li, Taihang Hu, Fahad Shahbaz Khan, Linxuan Li, Shiqi Yang, Yaxing Wang, Ming-Ming Cheng, Jian Yang

Figure 1 for Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models
Figure 2 for Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models
Figure 3 for Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models
Figure 4 for Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models
Viaarxiv icon