Picture for Andrew Zisserman

Andrew Zisserman

DeepMind

Text-Conditioned Resampler For Long Form Video Understanding

Add code
Dec 19, 2023
Figure 1 for Text-Conditioned Resampler For Long Form Video Understanding
Figure 2 for Text-Conditioned Resampler For Long Form Video Understanding
Figure 3 for Text-Conditioned Resampler For Long Form Video Understanding
Figure 4 for Text-Conditioned Resampler For Long Form Video Understanding
Viaarxiv icon

Appearance-based Refinement for Object-Centric Motion Segmentation

Add code
Dec 18, 2023
Figure 1 for Appearance-based Refinement for Object-Centric Motion Segmentation
Figure 2 for Appearance-based Refinement for Object-Centric Motion Segmentation
Figure 3 for Appearance-based Refinement for Object-Centric Motion Segmentation
Figure 4 for Appearance-based Refinement for Object-Centric Motion Segmentation
Viaarxiv icon

A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames

Add code
Dec 12, 2023
Figure 1 for A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Figure 2 for A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Figure 3 for A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Figure 4 for A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Viaarxiv icon

Learning from One Continuous Video Stream

Add code
Dec 01, 2023
Figure 1 for Learning from One Continuous Video Stream
Figure 2 for Learning from One Continuous Video Stream
Figure 3 for Learning from One Continuous Video Stream
Figure 4 for Learning from One Continuous Video Stream
Viaarxiv icon

No Representation Rules Them All in Category Discovery

Add code
Nov 28, 2023
Viaarxiv icon

Predicting Spine Geometry and Scoliosis from DXA Scans

Add code
Nov 15, 2023
Figure 1 for Predicting Spine Geometry and Scoliosis from DXA Scans
Figure 2 for Predicting Spine Geometry and Scoliosis from DXA Scans
Figure 3 for Predicting Spine Geometry and Scoliosis from DXA Scans
Figure 4 for Predicting Spine Geometry and Scoliosis from DXA Scans
Viaarxiv icon

Show from Tell: Audio-Visual Modelling in Clinical Settings

Add code
Oct 25, 2023
Figure 1 for Show from Tell: Audio-Visual Modelling in Clinical Settings
Figure 2 for Show from Tell: Audio-Visual Modelling in Clinical Settings
Figure 3 for Show from Tell: Audio-Visual Modelling in Clinical Settings
Figure 4 for Show from Tell: Audio-Visual Modelling in Clinical Settings
Viaarxiv icon

What Does Stable Diffusion Know about the 3D Scene?

Add code
Oct 10, 2023
Figure 1 for What Does Stable Diffusion Know about the 3D Scene?
Figure 2 for What Does Stable Diffusion Know about the 3D Scene?
Figure 3 for What Does Stable Diffusion Know about the 3D Scene?
Figure 4 for What Does Stable Diffusion Know about the 3D Scene?
Viaarxiv icon

AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description

Add code
Oct 10, 2023
Figure 1 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 2 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 3 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 4 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Viaarxiv icon

GestSync: Determining who is speaking without a talking head

Add code
Oct 08, 2023
Viaarxiv icon