Picture for Jonathon Shlens

Jonathon Shlens

Capabilities of Gemini Models in Medicine

Add code
May 01, 2024
Figure 1 for Capabilities of Gemini Models in Medicine
Figure 2 for Capabilities of Gemini Models in Medicine
Figure 3 for Capabilities of Gemini Models in Medicine
Figure 4 for Capabilities of Gemini Models in Medicine
Viaarxiv icon

On Robustness in Multimodal Learning

Add code
Apr 11, 2023
Figure 1 for On Robustness in Multimodal Learning
Figure 2 for On Robustness in Multimodal Learning
Figure 3 for On Robustness in Multimodal Learning
Figure 4 for On Robustness in Multimodal Learning
Viaarxiv icon

STAIR: Learning Sparse Text and Image Representation in Grounded Tokens

Add code
Feb 08, 2023
Figure 1 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Figure 2 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Figure 3 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Figure 4 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Viaarxiv icon

PseudoAugment: Learning to Use Unlabeled Data for Data Augmentation in Point Clouds

Add code
Oct 24, 2022
Viaarxiv icon

Perceptual Grouping in Vision-Language Models

Add code
Oct 18, 2022
Figure 1 for Perceptual Grouping in Vision-Language Models
Figure 2 for Perceptual Grouping in Vision-Language Models
Figure 3 for Perceptual Grouping in Vision-Language Models
Figure 4 for Perceptual Grouping in Vision-Language Models
Viaarxiv icon

Soft Calibration Objectives for Neural Networks

Add code
Jul 30, 2021
Figure 1 for Soft Calibration Objectives for Neural Networks
Figure 2 for Soft Calibration Objectives for Neural Networks
Figure 3 for Soft Calibration Objectives for Neural Networks
Figure 4 for Soft Calibration Objectives for Neural Networks
Viaarxiv icon

Scene Transformer: A unified multi-task model for behavior prediction and planning

Add code
Jun 15, 2021
Figure 1 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Figure 2 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Figure 3 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Figure 4 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Viaarxiv icon

Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset

Add code
Apr 20, 2021
Figure 1 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Figure 2 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Figure 3 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Figure 4 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Viaarxiv icon

Scaling Local Self-Attention for Parameter Efficient Visual Backbones

Add code
Mar 30, 2021
Figure 1 for Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Figure 2 for Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Figure 3 for Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Figure 4 for Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Viaarxiv icon

Scalable Scene Flow from Point Clouds in the Real World

Add code
Mar 15, 2021
Figure 1 for Scalable Scene Flow from Point Clouds in the Real World
Figure 2 for Scalable Scene Flow from Point Clouds in the Real World
Figure 3 for Scalable Scene Flow from Point Clouds in the Real World
Figure 4 for Scalable Scene Flow from Point Clouds in the Real World
Viaarxiv icon