Picture for Fahad Khan

Fahad Khan

CNR-ILC

VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Add code
Jun 13, 2024
Viaarxiv icon

On the Design of Human-Robot Collaboration Gestures

Add code
Feb 29, 2024
Figure 1 for On the Design of Human-Robot Collaboration Gestures
Viaarxiv icon

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation

Add code
Feb 27, 2024
Figure 1 for MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation
Figure 2 for MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation
Figure 3 for MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation
Figure 4 for MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation
Viaarxiv icon

Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes

Add code
Jan 02, 2024
Viaarxiv icon

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

Add code
Nov 25, 2023
Figure 1 for VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning
Figure 2 for VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning
Figure 3 for VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning
Figure 4 for VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning
Viaarxiv icon

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

Add code
Nov 22, 2023
Figure 1 for PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Figure 2 for PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Figure 3 for PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Figure 4 for PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Viaarxiv icon

Sentence-level Prompts Benefit Composed Image Retrieval

Add code
Oct 09, 2023
Figure 1 for Sentence-level Prompts Benefit Composed Image Retrieval
Figure 2 for Sentence-level Prompts Benefit Composed Image Retrieval
Figure 3 for Sentence-level Prompts Benefit Composed Image Retrieval
Figure 4 for Sentence-level Prompts Benefit Composed Image Retrieval
Viaarxiv icon

3D Indoor Instance Segmentation in an Open-World

Add code
Sep 25, 2023
Viaarxiv icon

Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment

Add code
Aug 24, 2023
Figure 1 for Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Figure 2 for Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Figure 3 for Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Figure 4 for Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Viaarxiv icon

LEAPS: End-to-End One-Step Person Search With Learnable Proposals

Add code
Mar 21, 2023
Figure 1 for LEAPS: End-to-End One-Step Person Search With Learnable Proposals
Figure 2 for LEAPS: End-to-End One-Step Person Search With Learnable Proposals
Figure 3 for LEAPS: End-to-End One-Step Person Search With Learnable Proposals
Figure 4 for LEAPS: End-to-End One-Step Person Search With Learnable Proposals
Viaarxiv icon