Picture for Aishwarya Kamath

Aishwarya Kamath

Capabilities of Gemini Models in Medicine

Add code
May 01, 2024
Figure 1 for Capabilities of Gemini Models in Medicine
Figure 2 for Capabilities of Gemini Models in Medicine
Figure 3 for Capabilities of Gemini Models in Medicine
Figure 4 for Capabilities of Gemini Models in Medicine
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning

Add code
Oct 06, 2022
Figure 1 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Figure 2 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Figure 3 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Figure 4 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Viaarxiv icon

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Add code
Jun 15, 2022
Figure 1 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Figure 2 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Figure 3 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Figure 4 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Viaarxiv icon

xGQA: Cross-Lingual Visual Question Answering

Add code
Sep 13, 2021
Figure 1 for xGQA: Cross-Lingual Visual Question Answering
Figure 2 for xGQA: Cross-Lingual Visual Question Answering
Figure 3 for xGQA: Cross-Lingual Visual Question Answering
Figure 4 for xGQA: Cross-Lingual Visual Question Answering
Viaarxiv icon

MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding

Add code
Apr 26, 2021
Figure 1 for MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
Figure 2 for MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
Figure 3 for MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
Figure 4 for MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
Viaarxiv icon

AdapterHub: A Framework for Adapting Transformers

Add code
Jul 15, 2020
Figure 1 for AdapterHub: A Framework for Adapting Transformers
Figure 2 for AdapterHub: A Framework for Adapting Transformers
Figure 3 for AdapterHub: A Framework for Adapting Transformers
Figure 4 for AdapterHub: A Framework for Adapting Transformers
Viaarxiv icon

AdapterFusion: Non-Destructive Task Composition for Transfer Learning

Add code
May 01, 2020
Figure 1 for AdapterFusion: Non-Destructive Task Composition for Transfer Learning
Figure 2 for AdapterFusion: Non-Destructive Task Composition for Transfer Learning
Figure 3 for AdapterFusion: Non-Destructive Task Composition for Transfer Learning
Figure 4 for AdapterFusion: Non-Destructive Task Composition for Transfer Learning
Viaarxiv icon

What do Deep Networks Like to Read?

Add code
Sep 10, 2019
Figure 1 for What do Deep Networks Like to Read?
Figure 2 for What do Deep Networks Like to Read?
Figure 3 for What do Deep Networks Like to Read?
Figure 4 for What do Deep Networks Like to Read?
Viaarxiv icon

A Survey on Semantic Parsing

Add code
Dec 10, 2018
Figure 1 for A Survey on Semantic Parsing
Viaarxiv icon