Alert button
Picture for Max Bain

Max Bain

Alert button

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

Add code
Bookmark button
Alert button
Apr 18, 2024
Aitor Ormazabal, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Deyu Fu, Donovan Ong, Eric Chen, Eugenie Lamprecht, Hai Pham, Isaac Ong, Kaloyan Aleksiev, Lei Li, Matthew Henderson, Max Bain, Mikel Artetxe, Nishant Relan, Piotr Padlewski, Qi Liu, Ren Chen, Samuel Phua, Yazheng Yang, Yi Tay, Yuqi Wang, Zhongkai Zhu, Zhihui Xie

Viaarxiv icon

AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description

Add code
Bookmark button
Alert button
Oct 10, 2023
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman

Figure 1 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 2 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 3 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 4 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Viaarxiv icon

OxfordVGG Submission to the EGO4D AV Transcription Challenge

Add code
Bookmark button
Alert button
Jul 18, 2023
Jaesung Huh, Max Bain, Andrew Zisserman

Figure 1 for OxfordVGG Submission to the EGO4D AV Transcription Challenge
Figure 2 for OxfordVGG Submission to the EGO4D AV Transcription Challenge
Viaarxiv icon

Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets

Add code
Bookmark button
Alert button
May 24, 2023
Brandon Smith, Miguel Farinha, Siobhan Mackenzie Hall, Hannah Rose Kirk, Aleksandar Shtedritski, Max Bain

Figure 1 for Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets
Figure 2 for Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets
Figure 3 for Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets
Figure 4 for Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets
Viaarxiv icon

AutoAD: Movie Description in Context

Add code
Bookmark button
Alert button
Mar 29, 2023
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman

Figure 1 for AutoAD: Movie Description in Context
Figure 2 for AutoAD: Movie Description in Context
Figure 3 for AutoAD: Movie Description in Context
Figure 4 for AutoAD: Movie Description in Context
Viaarxiv icon

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Add code
Bookmark button
Alert button
Mar 01, 2023
Max Bain, Jaesung Huh, Tengda Han, Andrew Zisserman

Figure 1 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 2 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 3 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 4 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Viaarxiv icon

A CLIP-Hitchhiker's Guide to Long Video Retrieval

Add code
Bookmark button
Alert button
May 17, 2022
Max Bain, Arsha Nagrani, Gül Varol, Andrew Zisserman

Figure 1 for A CLIP-Hitchhiker's Guide to Long Video Retrieval
Figure 2 for A CLIP-Hitchhiker's Guide to Long Video Retrieval
Figure 3 for A CLIP-Hitchhiker's Guide to Long Video Retrieval
Figure 4 for A CLIP-Hitchhiker's Guide to Long Video Retrieval
Viaarxiv icon

A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning

Add code
Bookmark button
Alert button
Apr 01, 2022
Hugo Berg, Siobhan Mackenzie Hall, Yash Bhalgat, Wonsuk Yang, Hannah Rose Kirk, Aleksandar Shtedritski, Max Bain

Figure 1 for A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning
Figure 2 for A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning
Figure 3 for A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning
Figure 4 for A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning
Viaarxiv icon

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval

Add code
Bookmark button
Alert button
Apr 01, 2021
Max Bain, Arsha Nagrani, Gül Varol, Andrew Zisserman

Figure 1 for Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Figure 2 for Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Figure 3 for Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Figure 4 for Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Viaarxiv icon

Condensed Movies: Story Based Retrieval with Contextual Embeddings

Add code
Bookmark button
Alert button
May 08, 2020
Max Bain, Arsha Nagrani, Andrew Brown, Andrew Zisserman

Figure 1 for Condensed Movies: Story Based Retrieval with Contextual Embeddings
Figure 2 for Condensed Movies: Story Based Retrieval with Contextual Embeddings
Figure 3 for Condensed Movies: Story Based Retrieval with Contextual Embeddings
Figure 4 for Condensed Movies: Story Based Retrieval with Contextual Embeddings
Viaarxiv icon