Picture for Andrea Vedaldi

Andrea Vedaldi

Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning

Add code
Mar 18, 2021
Figure 1 for Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Figure 2 for Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Figure 3 for Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Figure 4 for Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Viaarxiv icon

Continuous Surface Embeddings

Add code
Nov 24, 2020
Figure 1 for Continuous Surface Embeddings
Figure 2 for Continuous Surface Embeddings
Figure 3 for Continuous Surface Embeddings
Figure 4 for Continuous Surface Embeddings
Viaarxiv icon

3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data

Add code
Nov 02, 2020
Figure 1 for 3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data
Figure 2 for 3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data
Figure 3 for 3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data
Figure 4 for 3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data
Viaarxiv icon

Quantifying Learnability and Describability of Visual Concepts Emerging in Representation Learning

Add code
Oct 27, 2020
Figure 1 for Quantifying Learnability and Describability of Visual Concepts Emerging in Representation Learning
Figure 2 for Quantifying Learnability and Describability of Visual Concepts Emerging in Representation Learning
Figure 3 for Quantifying Learnability and Describability of Visual Concepts Emerging in Representation Learning
Figure 4 for Quantifying Learnability and Describability of Visual Concepts Emerging in Representation Learning
Viaarxiv icon

Support-set bottlenecks for video-text representation learning

Add code
Oct 06, 2020
Figure 1 for Support-set bottlenecks for video-text representation learning
Figure 2 for Support-set bottlenecks for video-text representation learning
Figure 3 for Support-set bottlenecks for video-text representation learning
Figure 4 for Support-set bottlenecks for video-text representation learning
Viaarxiv icon

Calibrating Self-supervised Monocular Depth Estimation

Add code
Sep 16, 2020
Figure 1 for Calibrating Self-supervised Monocular Depth Estimation
Figure 2 for Calibrating Self-supervised Monocular Depth Estimation
Figure 3 for Calibrating Self-supervised Monocular Depth Estimation
Figure 4 for Calibrating Self-supervised Monocular Depth Estimation
Viaarxiv icon

Canonical 3D Deformer Maps: Unifying parametric and non-parametric methods for dense weakly-supervised category reconstruction

Add code
Aug 28, 2020
Figure 1 for Canonical 3D Deformer Maps: Unifying parametric and non-parametric methods for dense weakly-supervised category reconstruction
Figure 2 for Canonical 3D Deformer Maps: Unifying parametric and non-parametric methods for dense weakly-supervised category reconstruction
Figure 3 for Canonical 3D Deformer Maps: Unifying parametric and non-parametric methods for dense weakly-supervised category reconstruction
Figure 4 for Canonical 3D Deformer Maps: Unifying parametric and non-parametric methods for dense weakly-supervised category reconstruction
Viaarxiv icon

Automatic Recall Machines: Internal Replay, Continual Learning and the Brain

Add code
Jul 15, 2020
Figure 1 for Automatic Recall Machines: Internal Replay, Continual Learning and the Brain
Figure 2 for Automatic Recall Machines: Internal Replay, Continual Learning and the Brain
Figure 3 for Automatic Recall Machines: Internal Replay, Continual Learning and the Brain
Figure 4 for Automatic Recall Machines: Internal Replay, Continual Learning and the Brain
Viaarxiv icon

RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces

Add code
Jul 02, 2020
Figure 1 for RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces
Figure 2 for RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces
Figure 3 for RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces
Figure 4 for RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces
Viaarxiv icon

Labelling unlabelled videos from scratch with multi-modal self-supervision

Add code
Jun 24, 2020
Figure 1 for Labelling unlabelled videos from scratch with multi-modal self-supervision
Figure 2 for Labelling unlabelled videos from scratch with multi-modal self-supervision
Figure 3 for Labelling unlabelled videos from scratch with multi-modal self-supervision
Figure 4 for Labelling unlabelled videos from scratch with multi-modal self-supervision
Viaarxiv icon