Picture for Mario Lucic

Mario Lucic

PaLI-X: On Scaling up a Multilingual Vision and Language Model

Add code
May 29, 2023
Figure 1 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 2 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 3 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 4 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Viaarxiv icon

Audiovisual Masked Autoencoders

Add code
Dec 09, 2022
Figure 1 for Audiovisual Masked Autoencoders
Figure 2 for Audiovisual Masked Autoencoders
Figure 3 for Audiovisual Masked Autoencoders
Figure 4 for Audiovisual Masked Autoencoders
Viaarxiv icon

RUST: Latent Neural Scene Representations from Unposed Imagery

Add code
Nov 25, 2022
Figure 1 for RUST: Latent Neural Scene Representations from Unposed Imagery
Figure 2 for RUST: Latent Neural Scene Representations from Unposed Imagery
Figure 3 for RUST: Latent Neural Scene Representations from Unposed Imagery
Figure 4 for RUST: Latent Neural Scene Representations from Unposed Imagery
Viaarxiv icon

VCT: A Video Compression Transformer

Add code
Jun 15, 2022
Figure 1 for VCT: A Video Compression Transformer
Figure 2 for VCT: A Video Compression Transformer
Figure 3 for VCT: A Video Compression Transformer
Figure 4 for VCT: A Video Compression Transformer
Viaarxiv icon

Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations

Add code
Nov 29, 2021
Figure 1 for Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations
Figure 2 for Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations
Figure 3 for Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations
Figure 4 for Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations
Viaarxiv icon

PolyViT: Co-training Vision Transformers on Images, Videos and Audio

Add code
Nov 25, 2021
Figure 1 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Figure 2 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Figure 3 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Figure 4 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Viaarxiv icon

Revisiting the Calibration of Modern Neural Networks

Add code
Jun 15, 2021
Figure 1 for Revisiting the Calibration of Modern Neural Networks
Figure 2 for Revisiting the Calibration of Modern Neural Networks
Figure 3 for Revisiting the Calibration of Modern Neural Networks
Figure 4 for Revisiting the Calibration of Modern Neural Networks
Viaarxiv icon

A Near-Optimal Algorithm for Debiasing Trained Machine Learning Models

Add code
Jun 06, 2021
Figure 1 for A Near-Optimal Algorithm for Debiasing Trained Machine Learning Models
Figure 2 for A Near-Optimal Algorithm for Debiasing Trained Machine Learning Models
Figure 3 for A Near-Optimal Algorithm for Debiasing Trained Machine Learning Models
Figure 4 for A Near-Optimal Algorithm for Debiasing Trained Machine Learning Models
Viaarxiv icon

MLP-Mixer: An all-MLP Architecture for Vision

Add code
May 17, 2021
Figure 1 for MLP-Mixer: An all-MLP Architecture for Vision
Figure 2 for MLP-Mixer: An all-MLP Architecture for Vision
Figure 3 for MLP-Mixer: An all-MLP Architecture for Vision
Figure 4 for MLP-Mixer: An all-MLP Architecture for Vision
Viaarxiv icon

SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size

Add code
Apr 09, 2021
Figure 1 for SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Figure 2 for SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Figure 3 for SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Figure 4 for SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Viaarxiv icon