Alert button
Picture for Muhammad Ferjad Naeem

Muhammad Ferjad Naeem

Alert button

How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs

Add code
Bookmark button
Alert button
May 08, 2024
Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Jameel Hassan, Muzammal Naseer, Federico Tombari, Fahad Shahbaz Khan, Salman Khan

Figure 1 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 2 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 3 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 4 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Viaarxiv icon

Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs

Add code
Bookmark button
Alert button
May 06, 2024
Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Jameel Hassan, Muzammal Naseer, Federico Tombari, Fahad Shahbaz Khan, Salman Khan

Figure 1 for Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 2 for Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 3 for Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 4 for Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Viaarxiv icon

GiT: Towards Generalist Vision Transformer through Universal Language Interface

Add code
Bookmark button
Alert button
Mar 14, 2024
Haiyang Wang, Hao Tang, Li Jiang, Shaoshuai Shi, Muhammad Ferjad Naeem, Hongsheng Li, Bernt Schiele, Liwei Wang

Figure 1 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 2 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 3 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 4 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Viaarxiv icon

FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks

Add code
Bookmark button
Alert button
Mar 11, 2024
Muhammad Saif Ullah Khan, Muhammad Ferjad Naeem, Federico Tombari, Luc Van Gool, Didier Stricker, Muhammad Zeshan Afzal

Figure 1 for FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks
Figure 2 for FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks
Figure 3 for FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks
Figure 4 for FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks
Viaarxiv icon

Learning to Prompt with Text Only Supervision for Vision-Language Models

Add code
Bookmark button
Alert button
Jan 04, 2024
Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Muzammal Naseer, Luc Van Gool, Federico Tombari

Viaarxiv icon

SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance

Add code
Bookmark button
Alert button
Nov 27, 2023
Lukas Hoyer, David Joseph Tan, Muhammad Ferjad Naeem, Luc Van Gool, Federico Tombari

Figure 1 for SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Figure 2 for SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Figure 3 for SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Figure 4 for SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Viaarxiv icon

SILC: Improving Vision Language Pretraining with Self-Distillation

Add code
Bookmark button
Alert button
Oct 20, 2023
Muhammad Ferjad Naeem, Yongqin Xian, Xiaohua Zhai, Lukas Hoyer, Luc Van Gool, Federico Tombari

Figure 1 for SILC: Improving Vision Language Pretraining with Self-Distillation
Figure 2 for SILC: Improving Vision Language Pretraining with Self-Distillation
Figure 3 for SILC: Improving Vision Language Pretraining with Self-Distillation
Figure 4 for SILC: Improving Vision Language Pretraining with Self-Distillation
Viaarxiv icon

Introducing Language Guidance in Prompt-based Continual Learning

Add code
Bookmark button
Alert button
Aug 30, 2023
Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Didier Stricker, Federico Tombari, Muhammad Zeshan Afzal

Figure 1 for Introducing Language Guidance in Prompt-based Continual Learning
Figure 2 for Introducing Language Guidance in Prompt-based Continual Learning
Figure 3 for Introducing Language Guidance in Prompt-based Continual Learning
Figure 4 for Introducing Language Guidance in Prompt-based Continual Learning
Viaarxiv icon

I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification

Add code
Bookmark button
Alert button
Dec 05, 2022
Muhammad Ferjad Naeem, Muhammad Gul Zain Ali Khan, Yongqin Xian, Muhammad Zeshan Afzal, Didier Stricker, Luc Van Gool, Federico Tombari

Figure 1 for I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification
Figure 2 for I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification
Figure 3 for I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification
Figure 4 for I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification
Viaarxiv icon

I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification

Add code
Bookmark button
Alert button
Sep 21, 2022
Muhammad Ferjad Naeem, Yongqin Xian, Luc Van Gool, Federico Tombari

Figure 1 for I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification
Figure 2 for I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification
Figure 3 for I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification
Figure 4 for I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification
Viaarxiv icon