Picture for Mandela Patrick

Mandela Patrick

Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers

Add code
Jun 09, 2021
Figure 1 for Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers
Figure 2 for Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers
Figure 3 for Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers
Figure 4 for Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers
Viaarxiv icon

Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models

Add code
Apr 15, 2021
Figure 1 for Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Figure 2 for Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Figure 3 for Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Figure 4 for Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Viaarxiv icon

Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning

Add code
Mar 18, 2021
Figure 1 for Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Figure 2 for Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Figure 3 for Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Figure 4 for Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Viaarxiv icon

Support-set bottlenecks for video-text representation learning

Add code
Oct 06, 2020
Figure 1 for Support-set bottlenecks for video-text representation learning
Figure 2 for Support-set bottlenecks for video-text representation learning
Figure 3 for Support-set bottlenecks for video-text representation learning
Figure 4 for Support-set bottlenecks for video-text representation learning
Viaarxiv icon

Labelling unlabelled videos from scratch with multi-modal self-supervision

Add code
Jun 24, 2020
Figure 1 for Labelling unlabelled videos from scratch with multi-modal self-supervision
Figure 2 for Labelling unlabelled videos from scratch with multi-modal self-supervision
Figure 3 for Labelling unlabelled videos from scratch with multi-modal self-supervision
Figure 4 for Labelling unlabelled videos from scratch with multi-modal self-supervision
Viaarxiv icon

Multi-modal Self-Supervision from Generalized Data Transformations

Add code
Mar 09, 2020
Figure 1 for Multi-modal Self-Supervision from Generalized Data Transformations
Figure 2 for Multi-modal Self-Supervision from Generalized Data Transformations
Figure 3 for Multi-modal Self-Supervision from Generalized Data Transformations
Figure 4 for Multi-modal Self-Supervision from Generalized Data Transformations
Viaarxiv icon

Understanding Deep Networks via Extremal Perturbations and Smooth Masks

Add code
Oct 18, 2019
Figure 1 for Understanding Deep Networks via Extremal Perturbations and Smooth Masks
Figure 2 for Understanding Deep Networks via Extremal Perturbations and Smooth Masks
Figure 3 for Understanding Deep Networks via Extremal Perturbations and Smooth Masks
Figure 4 for Understanding Deep Networks via Extremal Perturbations and Smooth Masks
Viaarxiv icon