Picture for Radu Soricut

Radu Soricut

PaLI-3 Vision Language Models: Smaller, Faster, Stronger

Add code
Oct 17, 2023
Figure 1 for PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Figure 2 for PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Figure 3 for PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Figure 4 for PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Viaarxiv icon

CausalLM is not optimal for in-context learning

Add code
Sep 03, 2023
Figure 1 for CausalLM is not optimal for in-context learning
Figure 2 for CausalLM is not optimal for in-context learning
Figure 3 for CausalLM is not optimal for in-context learning
Figure 4 for CausalLM is not optimal for in-context learning
Viaarxiv icon

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

Add code
Jul 28, 2023
Viaarxiv icon

PaLI-X: On Scaling up a Multilingual Vision and Language Model

Add code
May 29, 2023
Figure 1 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 2 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 3 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 4 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Viaarxiv icon

Connecting Vision and Language with Video Localized Narratives

Add code
Mar 15, 2023
Viaarxiv icon

Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting

Add code
Dec 13, 2022
Figure 1 for Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Figure 2 for Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Figure 3 for Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Figure 4 for Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Viaarxiv icon

Improving Robust Generalization by Direct PAC-Bayesian Bound Minimization

Add code
Nov 22, 2022
Viaarxiv icon

PaLI: A Jointly-Scaled Multilingual Language-Image Model

Add code
Sep 16, 2022
Figure 1 for PaLI: A Jointly-Scaled Multilingual Language-Image Model
Figure 2 for PaLI: A Jointly-Scaled Multilingual Language-Image Model
Figure 3 for PaLI: A Jointly-Scaled Multilingual Language-Image Model
Figure 4 for PaLI: A Jointly-Scaled Multilingual Language-Image Model
Viaarxiv icon

PreSTU: Pre-Training for Scene-Text Understanding

Add code
Sep 12, 2022
Figure 1 for PreSTU: Pre-Training for Scene-Text Understanding
Figure 2 for PreSTU: Pre-Training for Scene-Text Understanding
Figure 3 for PreSTU: Pre-Training for Scene-Text Understanding
Figure 4 for PreSTU: Pre-Training for Scene-Text Understanding
Viaarxiv icon

Towards Multi-Lingual Visual Question Answering

Add code
Sep 12, 2022
Figure 1 for Towards Multi-Lingual Visual Question Answering
Figure 2 for Towards Multi-Lingual Visual Question Answering
Figure 3 for Towards Multi-Lingual Visual Question Answering
Figure 4 for Towards Multi-Lingual Visual Question Answering
Viaarxiv icon