Picture for Salman Khan

Salman Khan

VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs

Add code
Jun 14, 2024
Figure 1 for VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs
Figure 2 for VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs
Figure 3 for VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs
Figure 4 for VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs
Viaarxiv icon

Towards Evaluating the Robustness of Visual State Space Models

Add code
Jun 13, 2024
Figure 1 for Towards Evaluating the Robustness of Visual State Space Models
Figure 2 for Towards Evaluating the Robustness of Visual State Space Models
Figure 3 for Towards Evaluating the Robustness of Visual State Space Models
Figure 4 for Towards Evaluating the Robustness of Visual State Space Models
Viaarxiv icon

VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Add code
Jun 13, 2024
Figure 1 for VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Figure 2 for VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Figure 3 for VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Figure 4 for VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Viaarxiv icon

On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models

Add code
Jun 12, 2024
Figure 1 for On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
Figure 2 for On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
Figure 3 for On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
Figure 4 for On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
Viaarxiv icon

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning

Add code
Jun 06, 2024
Viaarxiv icon

Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation

Add code
Jun 04, 2024
Figure 1 for Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Figure 2 for Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Figure 3 for Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Figure 4 for Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Viaarxiv icon

Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging

Add code
Jun 01, 2024
Figure 1 for Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging
Figure 2 for Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging
Figure 3 for Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging
Figure 4 for Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging
Viaarxiv icon

Multi-modal Generation via Cross-Modal In-Context Learning

Add code
May 28, 2024
Figure 1 for Multi-modal Generation via Cross-Modal In-Context Learning
Figure 2 for Multi-modal Generation via Cross-Modal In-Context Learning
Figure 3 for Multi-modal Generation via Cross-Modal In-Context Learning
Figure 4 for Multi-modal Generation via Cross-Modal In-Context Learning
Viaarxiv icon

Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning

Add code
May 20, 2024
Viaarxiv icon

How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs

Add code
May 08, 2024
Figure 1 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 2 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 3 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 4 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Viaarxiv icon