Alert button
Picture for Desmond Elliott

Desmond Elliott

Alert button

SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation

Add code
Bookmark button
Alert button
Sep 30, 2022
Rita Ramos, Bruno Martins, Desmond Elliott, Yova Kementchedjhieva

Figure 1 for SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
Figure 2 for SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
Figure 3 for SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
Figure 4 for SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
Viaarxiv icon

Language Modelling with Pixels

Add code
Bookmark button
Alert button
Jul 14, 2022
Phillip Rust, Jonas F. Lotz, Emanuele Bugliarello, Elizabeth Salesky, Miryam de Lhoneux, Desmond Elliott

Figure 1 for Language Modelling with Pixels
Figure 2 for Language Modelling with Pixels
Figure 3 for Language Modelling with Pixels
Figure 4 for Language Modelling with Pixels
Viaarxiv icon

Revisiting Transformer-based Models for Long Document Classification

Add code
Bookmark button
Alert button
Apr 14, 2022
Xiang Dai, Ilias Chalkidis, Sune Darkner, Desmond Elliott

Figure 1 for Revisiting Transformer-based Models for Long Document Classification
Figure 2 for Revisiting Transformer-based Models for Long Document Classification
Figure 3 for Revisiting Transformer-based Models for Long Document Classification
Figure 4 for Revisiting Transformer-based Models for Long Document Classification
Viaarxiv icon

IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages

Add code
Bookmark button
Alert button
Jan 27, 2022
Emanuele Bugliarello, Fangyu Liu, Jonas Pfeiffer, Siva Reddy, Desmond Elliott, Edoardo Maria Ponti, Ivan Vulić

Figure 1 for IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
Figure 2 for IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
Figure 3 for IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
Figure 4 for IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
Viaarxiv icon

Visually Grounded Reasoning across Languages and Cultures

Add code
Bookmark button
Alert button
Sep 28, 2021
Fangyu Liu, Emanuele Bugliarello, Edoardo Maria Ponti, Siva Reddy, Nigel Collier, Desmond Elliott

Figure 1 for Visually Grounded Reasoning across Languages and Cultures
Figure 2 for Visually Grounded Reasoning across Languages and Cultures
Figure 3 for Visually Grounded Reasoning across Languages and Cultures
Figure 4 for Visually Grounded Reasoning across Languages and Cultures
Viaarxiv icon

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

Add code
Bookmark button
Alert button
Sep 14, 2021
Rasmus Kær Jørgensen, Mareike Hartmann, Xiang Dai, Desmond Elliott

Figure 1 for MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model
Figure 2 for MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model
Figure 3 for MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model
Figure 4 for MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model
Viaarxiv icon

Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers

Add code
Bookmark button
Alert button
Sep 09, 2021
Stella Frank, Emanuele Bugliarello, Desmond Elliott

Figure 1 for Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers
Figure 2 for Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers
Figure 3 for Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers
Figure 4 for Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers
Viaarxiv icon

The Role of Syntactic Planning in Compositional Image Captioning

Add code
Bookmark button
Alert button
Jan 28, 2021
Emanuele Bugliarello, Desmond Elliott

Figure 1 for The Role of Syntactic Planning in Compositional Image Captioning
Figure 2 for The Role of Syntactic Planning in Compositional Image Captioning
Figure 3 for The Role of Syntactic Planning in Compositional Image Captioning
Figure 4 for The Role of Syntactic Planning in Compositional Image Captioning
Viaarxiv icon

Multimodal Pretraining Unmasked: Unifying the Vision and Language BERTs

Add code
Bookmark button
Alert button
Nov 30, 2020
Emanuele Bugliarello, Ryan Cotterell, Naoaki Okazaki, Desmond Elliott

Figure 1 for Multimodal Pretraining Unmasked: Unifying the Vision and Language BERTs
Figure 2 for Multimodal Pretraining Unmasked: Unifying the Vision and Language BERTs
Figure 3 for Multimodal Pretraining Unmasked: Unifying the Vision and Language BERTs
Figure 4 for Multimodal Pretraining Unmasked: Unifying the Vision and Language BERTs
Viaarxiv icon

Multimodal Speech Recognition with Unstructured Audio Masking

Add code
Bookmark button
Alert button
Oct 16, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott

Figure 1 for Multimodal Speech Recognition with Unstructured Audio Masking
Figure 2 for Multimodal Speech Recognition with Unstructured Audio Masking
Figure 3 for Multimodal Speech Recognition with Unstructured Audio Masking
Figure 4 for Multimodal Speech Recognition with Unstructured Audio Masking
Viaarxiv icon