Picture for Muzammal Naseer

Muzammal Naseer

Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models

Add code
Feb 03, 2025
Figure 1 for Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models
Figure 2 for Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models
Figure 3 for Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models
Figure 4 for Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models
Viaarxiv icon

Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation

Add code
Jan 08, 2025
Figure 1 for Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation
Figure 2 for Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation
Figure 3 for Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation
Figure 4 for Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation
Viaarxiv icon

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models

Add code
Dec 24, 2024
Figure 1 for Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
Figure 2 for Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
Figure 3 for Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
Figure 4 for Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
Viaarxiv icon

UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities

Add code
Dec 13, 2024
Figure 1 for UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities
Figure 2 for UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities
Figure 3 for UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities
Figure 4 for UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities
Viaarxiv icon

AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment

Add code
Oct 02, 2024
Figure 1 for AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment
Figure 2 for AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment
Figure 3 for AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment
Figure 4 for AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment
Viaarxiv icon

Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation

Add code
Oct 01, 2024
Figure 1 for Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation
Figure 2 for Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation
Figure 3 for Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation
Figure 4 for Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation
Viaarxiv icon

CDChat: A Large Multimodal Model for Remote Sensing Change Description

Add code
Sep 24, 2024
Figure 1 for CDChat: A Large Multimodal Model for Remote Sensing Change Description
Figure 2 for CDChat: A Large Multimodal Model for Remote Sensing Change Description
Figure 3 for CDChat: A Large Multimodal Model for Remote Sensing Change Description
Figure 4 for CDChat: A Large Multimodal Model for Remote Sensing Change Description
Viaarxiv icon

Distillation-free Scaling of Large SSMs for Images and Videos

Add code
Sep 18, 2024
Figure 1 for Distillation-free Scaling of Large SSMs for Images and Videos
Figure 2 for Distillation-free Scaling of Large SSMs for Images and Videos
Figure 3 for Distillation-free Scaling of Large SSMs for Images and Videos
Figure 4 for Distillation-free Scaling of Large SSMs for Images and Videos
Viaarxiv icon

PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning

Add code
Aug 29, 2024
Figure 1 for PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning
Figure 2 for PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning
Figure 3 for PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning
Figure 4 for PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning
Viaarxiv icon

STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models

Add code
Aug 29, 2024
Viaarxiv icon