Picture for Ross Hemsley

Ross Hemsley

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Zorro: the masked multimodal transformer

Add code
Jan 23, 2023
Figure 1 for Zorro: the masked multimodal transformer
Figure 2 for Zorro: the masked multimodal transformer
Figure 3 for Zorro: the masked multimodal transformer
Figure 4 for Zorro: the masked multimodal transformer
Viaarxiv icon

NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

Add code
Nov 15, 2022
Figure 1 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 2 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 3 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 4 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Viaarxiv icon

Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps

Add code
Nov 10, 2020
Figure 1 for Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps
Figure 2 for Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps
Figure 3 for Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps
Figure 4 for Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps
Viaarxiv icon