Picture for Adrià Recasens

Adrià Recasens

Massachusetts Institute of Technology

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

TacticAI: an AI assistant for football tactics

Add code
Oct 17, 2023
Figure 1 for TacticAI: an AI assistant for football tactics
Figure 2 for TacticAI: an AI assistant for football tactics
Figure 3 for TacticAI: an AI assistant for football tactics
Figure 4 for TacticAI: an AI assistant for football tactics
Viaarxiv icon

Zorro: the masked multimodal transformer

Add code
Jan 23, 2023
Figure 1 for Zorro: the masked multimodal transformer
Figure 2 for Zorro: the masked multimodal transformer
Figure 3 for Zorro: the masked multimodal transformer
Figure 4 for Zorro: the masked multimodal transformer
Viaarxiv icon

TAP-Vid: A Benchmark for Tracking Any Point in a Video

Add code
Nov 07, 2022
Figure 1 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Figure 2 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Figure 3 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Figure 4 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Viaarxiv icon

Broaden Your Views for Self-Supervised Video Learning

Add code
Mar 30, 2021
Figure 1 for Broaden Your Views for Self-Supervised Video Learning
Figure 2 for Broaden Your Views for Self-Supervised Video Learning
Figure 3 for Broaden Your Views for Self-Supervised Video Learning
Figure 4 for Broaden Your Views for Self-Supervised Video Learning
Viaarxiv icon

Self-Supervised MultiModal Versatile Networks

Add code
Jun 29, 2020
Figure 1 for Self-Supervised MultiModal Versatile Networks
Figure 2 for Self-Supervised MultiModal Versatile Networks
Figure 3 for Self-Supervised MultiModal Versatile Networks
Figure 4 for Self-Supervised MultiModal Versatile Networks
Viaarxiv icon

Learning to Zoom: a Saliency-Based Sampling Layer for Neural Networks

Add code
Sep 10, 2018
Figure 1 for Learning to Zoom: a Saliency-Based Sampling Layer for Neural Networks
Figure 2 for Learning to Zoom: a Saliency-Based Sampling Layer for Neural Networks
Figure 3 for Learning to Zoom: a Saliency-Based Sampling Layer for Neural Networks
Figure 4 for Learning to Zoom: a Saliency-Based Sampling Layer for Neural Networks
Viaarxiv icon

Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics

Add code
Jul 27, 2018
Figure 1 for Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics
Figure 2 for Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics
Figure 3 for Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics
Figure 4 for Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics
Viaarxiv icon

Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input

Add code
Apr 04, 2018
Figure 1 for Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
Figure 2 for Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
Figure 3 for Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
Figure 4 for Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
Viaarxiv icon