Picture for Kanchana Ranasinghe

Kanchana Ranasinghe

Understanding Long Videos in One Multimodal Language Model Pass

Add code
Mar 25, 2024
Figure 1 for Understanding Long Videos in One Multimodal Language Model Pass
Figure 2 for Understanding Long Videos in One Multimodal Language Model Pass
Figure 3 for Understanding Long Videos in One Multimodal Language Model Pass
Figure 4 for Understanding Long Videos in One Multimodal Language Model Pass
Viaarxiv icon

Language Repository for Long Video Understanding

Add code
Mar 21, 2024
Figure 1 for Language Repository for Long Video Understanding
Figure 2 for Language Repository for Long Video Understanding
Figure 3 for Language Repository for Long Video Understanding
Figure 4 for Language Repository for Long Video Understanding
Viaarxiv icon

Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning

Add code
Mar 21, 2024
Viaarxiv icon

Diffusion Illusions: Hiding Images in Plain Sight

Add code
Dec 06, 2023
Figure 1 for Diffusion Illusions: Hiding Images in Plain Sight
Figure 2 for Diffusion Illusions: Hiding Images in Plain Sight
Figure 3 for Diffusion Illusions: Hiding Images in Plain Sight
Figure 4 for Diffusion Illusions: Hiding Images in Plain Sight
Viaarxiv icon

Language-based Action Concept Spaces Improve Video Self-Supervised Learning

Add code
Jul 20, 2023
Viaarxiv icon

Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors

Add code
Nov 23, 2022
Figure 1 for Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
Figure 2 for Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
Figure 3 for Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
Figure 4 for Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
Viaarxiv icon

Perceptual Grouping in Vision-Language Models

Add code
Oct 18, 2022
Figure 1 for Perceptual Grouping in Vision-Language Models
Figure 2 for Perceptual Grouping in Vision-Language Models
Figure 3 for Perceptual Grouping in Vision-Language Models
Figure 4 for Perceptual Grouping in Vision-Language Models
Viaarxiv icon

Self-supervised Video Transformer

Add code
Dec 02, 2021
Figure 1 for Self-supervised Video Transformer
Figure 2 for Self-supervised Video Transformer
Figure 3 for Self-supervised Video Transformer
Figure 4 for Self-supervised Video Transformer
Viaarxiv icon

Intriguing Properties of Vision Transformers

Add code
Jun 08, 2021
Figure 1 for Intriguing Properties of Vision Transformers
Figure 2 for Intriguing Properties of Vision Transformers
Figure 3 for Intriguing Properties of Vision Transformers
Figure 4 for Intriguing Properties of Vision Transformers
Viaarxiv icon

On Improving Adversarial Transferability of Vision Transformers

Add code
Jun 08, 2021
Figure 1 for On Improving Adversarial Transferability of Vision Transformers
Figure 2 for On Improving Adversarial Transferability of Vision Transformers
Figure 3 for On Improving Adversarial Transferability of Vision Transformers
Figure 4 for On Improving Adversarial Transferability of Vision Transformers
Viaarxiv icon