Alert button
Picture for Lucas Smaira

Lucas Smaira

Alert button

DeepMind

Perception Test: A Diagnostic Benchmark for Multimodal Video Models

May 23, 2023
Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira

Figure 1 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 2 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 3 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 4 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Viaarxiv icon

Zorro: the masked multimodal transformer

Jan 23, 2023
Adrià Recasens, Jason Lin, Joāo Carreira, Drew Jaegle, Luyu Wang, Jean-baptiste Alayrac, Pauline Luc, Antoine Miech, Lucas Smaira, Ross Hemsley, Andrew Zisserman

Figure 1 for Zorro: the masked multimodal transformer
Figure 2 for Zorro: the masked multimodal transformer
Figure 3 for Zorro: the masked multimodal transformer
Figure 4 for Zorro: the masked multimodal transformer
Viaarxiv icon

TAP-Vid: A Benchmark for Tracking Any Point in a Video

Nov 07, 2022
Carl Doersch, Ankush Gupta, Larisa Markeeva, Adrià Recasens, Lucas Smaira, Yusuf Aytar, João Carreira, Andrew Zisserman, Yi Yang

Figure 1 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Figure 2 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Figure 3 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Figure 4 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Viaarxiv icon

Towards Learning Universal Audio Representations

Dec 01, 2021
Luyu Wang, Pauline Luc, Yan Wu, Adria Recasens, Lucas Smaira, Andrew Brock, Andrew Jaegle, Jean-Baptiste Alayrac, Sander Dieleman, Joao Carreira, Aaron van den Oord

Figure 1 for Towards Learning Universal Audio Representations
Figure 2 for Towards Learning Universal Audio Representations
Figure 3 for Towards Learning Universal Audio Representations
Figure 4 for Towards Learning Universal Audio Representations
Viaarxiv icon

Human-Agent Cooperation in Bridge Bidding

Nov 28, 2020
Edward Lockhart, Neil Burch, Nolan Bard, Sebastian Borgeaud, Tom Eccles, Lucas Smaira, Ray Smith

Figure 1 for Human-Agent Cooperation in Bridge Bidding
Viaarxiv icon

A Short Note on the Kinetics-700-2020 Human Action Dataset

Oct 21, 2020
Lucas Smaira, João Carreira, Eric Noland, Ellen Clancy, Amy Wu, Andrew Zisserman

Figure 1 for A Short Note on the Kinetics-700-2020 Human Action Dataset
Figure 2 for A Short Note on the Kinetics-700-2020 Human Action Dataset
Figure 3 for A Short Note on the Kinetics-700-2020 Human Action Dataset
Figure 4 for A Short Note on the Kinetics-700-2020 Human Action Dataset
Viaarxiv icon

Self-Supervised MultiModal Versatile Networks

Jun 29, 2020
Jean-Baptiste Alayrac, Adrià Recasens, Rosalia Schneider, Relja Arandjelović, Jason Ramapuram, Jeffrey De Fauw, Lucas Smaira, Sander Dieleman, Andrew Zisserman

Figure 1 for Self-Supervised MultiModal Versatile Networks
Figure 2 for Self-Supervised MultiModal Versatile Networks
Figure 3 for Self-Supervised MultiModal Versatile Networks
Figure 4 for Self-Supervised MultiModal Versatile Networks
Viaarxiv icon

Visual Grounding in Video for Unsupervised Word Translation

Mar 26, 2020
Gunnar A. Sigurdsson, Jean-Baptiste Alayrac, Aida Nematzadeh, Lucas Smaira, Mateusz Malinowski, João Carreira, Phil Blunsom, Andrew Zisserman

Figure 1 for Visual Grounding in Video for Unsupervised Word Translation
Figure 2 for Visual Grounding in Video for Unsupervised Word Translation
Figure 3 for Visual Grounding in Video for Unsupervised Word Translation
Figure 4 for Visual Grounding in Video for Unsupervised Word Translation
Viaarxiv icon