Picture for Ashish Shah

Ashish Shah

Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval

Add code
May 01, 2024
Figure 1 for Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval
Figure 2 for Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval
Figure 3 for Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval
Figure 4 for Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval
Viaarxiv icon

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Add code
Apr 08, 2024
Figure 1 for MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Figure 2 for MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Figure 3 for MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Figure 4 for MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Viaarxiv icon

Universal Pyramid Adversarial Training for Improved ViT Performance

Add code
Dec 26, 2023
Figure 1 for Universal Pyramid Adversarial Training for Improved ViT Performance
Figure 2 for Universal Pyramid Adversarial Training for Improved ViT Performance
Figure 3 for Universal Pyramid Adversarial Training for Improved ViT Performance
Figure 4 for Universal Pyramid Adversarial Training for Improved ViT Performance
Viaarxiv icon

Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding

Add code
Sep 20, 2023
Figure 1 for Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding
Figure 2 for Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding
Figure 3 for Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding
Figure 4 for Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding
Viaarxiv icon

Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

Add code
Dec 09, 2022
Figure 1 for Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Figure 2 for Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Figure 3 for Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Figure 4 for Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Viaarxiv icon

A Unified Model for Tracking and Image-Video Detection Has More Power

Add code
Nov 20, 2022
Figure 1 for A Unified Model for Tracking and Image-Video Detection Has More Power
Figure 2 for A Unified Model for Tracking and Image-Video Detection Has More Power
Figure 3 for A Unified Model for Tracking and Image-Video Detection Has More Power
Figure 4 for A Unified Model for Tracking and Image-Video Detection Has More Power
Viaarxiv icon

Raising the Bar on the Evaluation of Out-of-Distribution Detection

Add code
Sep 24, 2022
Figure 1 for Raising the Bar on the Evaluation of Out-of-Distribution Detection
Figure 2 for Raising the Bar on the Evaluation of Out-of-Distribution Detection
Figure 3 for Raising the Bar on the Evaluation of Out-of-Distribution Detection
Figure 4 for Raising the Bar on the Evaluation of Out-of-Distribution Detection
Viaarxiv icon

Object-Centric Unsupervised Image Captioning

Add code
Dec 02, 2021
Figure 1 for Object-Centric Unsupervised Image Captioning
Figure 2 for Object-Centric Unsupervised Image Captioning
Figure 3 for Object-Centric Unsupervised Image Captioning
Figure 4 for Object-Centric Unsupervised Image Captioning
Viaarxiv icon

MixNorm: Test-Time Adaptation Through Online Normalization Estimation

Add code
Oct 21, 2021
Figure 1 for MixNorm: Test-Time Adaptation Through Online Normalization Estimation
Figure 2 for MixNorm: Test-Time Adaptation Through Online Normalization Estimation
Figure 3 for MixNorm: Test-Time Adaptation Through Online Normalization Estimation
Figure 4 for MixNorm: Test-Time Adaptation Through Online Normalization Estimation
Viaarxiv icon

Self-appearance-aided Differential Evolution for Motion Transfer

Add code
Oct 09, 2021
Figure 1 for Self-appearance-aided Differential Evolution for Motion Transfer
Figure 2 for Self-appearance-aided Differential Evolution for Motion Transfer
Figure 3 for Self-appearance-aided Differential Evolution for Motion Transfer
Figure 4 for Self-appearance-aided Differential Evolution for Motion Transfer
Viaarxiv icon