Picture for Marvin Ritter

Marvin Ritter

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

PaLI-X: On Scaling up a Multilingual Vision and Language Model

Add code
May 29, 2023
Figure 1 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 2 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 3 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 4 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Viaarxiv icon

Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$

Add code
Mar 31, 2022
Figure 1 for Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$
Figure 2 for Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$
Viaarxiv icon

Continental-Scale Building Detection from High Resolution Satellite Imagery

Add code
Jul 29, 2021
Figure 1 for Continental-Scale Building Detection from High Resolution Satellite Imagery
Figure 2 for Continental-Scale Building Detection from High Resolution Satellite Imagery
Figure 3 for Continental-Scale Building Detection from High Resolution Satellite Imagery
Figure 4 for Continental-Scale Building Detection from High Resolution Satellite Imagery
Viaarxiv icon

Representation learning from videos in-the-wild: An object-centric approach

Add code
Oct 06, 2020
Figure 1 for Representation learning from videos in-the-wild: An object-centric approach
Figure 2 for Representation learning from videos in-the-wild: An object-centric approach
Figure 3 for Representation learning from videos in-the-wild: An object-centric approach
Figure 4 for Representation learning from videos in-the-wild: An object-centric approach
Viaarxiv icon

Self-Supervised Learning of Video-Induced Visual Invariances

Add code
Dec 05, 2019
Figure 1 for Self-Supervised Learning of Video-Induced Visual Invariances
Figure 2 for Self-Supervised Learning of Video-Induced Visual Invariances
Figure 3 for Self-Supervised Learning of Video-Induced Visual Invariances
Figure 4 for Self-Supervised Learning of Video-Induced Visual Invariances
Viaarxiv icon

High-Fidelity Image Generation With Fewer Labels

Add code
Mar 06, 2019
Figure 1 for High-Fidelity Image Generation With Fewer Labels
Figure 2 for High-Fidelity Image Generation With Fewer Labels
Figure 3 for High-Fidelity Image Generation With Fewer Labels
Figure 4 for High-Fidelity Image Generation With Fewer Labels
Viaarxiv icon

Self-Supervised Generative Adversarial Networks

Add code
Nov 27, 2018
Figure 1 for Self-Supervised Generative Adversarial Networks
Figure 2 for Self-Supervised Generative Adversarial Networks
Figure 3 for Self-Supervised Generative Adversarial Networks
Figure 4 for Self-Supervised Generative Adversarial Networks
Viaarxiv icon

Now Playing: Continuous low-power music recognition

Add code
Nov 29, 2017
Figure 1 for Now Playing: Continuous low-power music recognition
Figure 2 for Now Playing: Continuous low-power music recognition
Viaarxiv icon