Alert button
Picture for Saurabh Saxena

Saurabh Saxena

Alert button

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Dec 20, 2023
Saurabh Saxena, Junhwa Hur, Charles Herrmann, Deqing Sun, David J. Fleet

Viaarxiv icon

NeRFiller: Completing Scenes via Generative 3D Inpainting

Dec 07, 2023
Ethan Weber, Aleksander Hołyński, Varun Jampani, Saurabh Saxena, Noah Snavely, Abhishek Kar, Angjoo Kanazawa

Viaarxiv icon

The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation

Jun 02, 2023
Saurabh Saxena, Charles Herrmann, Junhwa Hur, Abhishek Kar, Mohammad Norouzi, Deqing Sun, David J. Fleet

Figure 1 for The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation
Figure 2 for The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation
Figure 3 for The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation
Figure 4 for The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation
Viaarxiv icon

Monocular Depth Estimation using Diffusion Models

Feb 28, 2023
Saurabh Saxena, Abhishek Kar, Mohammad Norouzi, David J. Fleet

Figure 1 for Monocular Depth Estimation using Diffusion Models
Figure 2 for Monocular Depth Estimation using Diffusion Models
Figure 3 for Monocular Depth Estimation using Diffusion Models
Figure 4 for Monocular Depth Estimation using Diffusion Models
Viaarxiv icon

A Generalist Framework for Panoptic Segmentation of Images and Videos

Oct 12, 2022
Ting Chen, Lala Li, Saurabh Saxena, Geoffrey Hinton, David J. Fleet

Figure 1 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 2 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 3 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 4 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Viaarxiv icon

A Unified Sequence Interface for Vision Tasks

Jun 15, 2022
Ting Chen, Saurabh Saxena, Lala Li, Tsung-Yi Lin, David J. Fleet, Geoffrey Hinton

Figure 1 for A Unified Sequence Interface for Vision Tasks
Figure 2 for A Unified Sequence Interface for Vision Tasks
Figure 3 for A Unified Sequence Interface for Vision Tasks
Figure 4 for A Unified Sequence Interface for Vision Tasks
Viaarxiv icon

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

May 23, 2022
Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J Fleet, Mohammad Norouzi

Figure 1 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 2 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 3 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 4 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Viaarxiv icon

Pix2seq: A Language Modeling Framework for Object Detection

Sep 22, 2021
Ting Chen, Saurabh Saxena, Lala Li, David J. Fleet, Geoffrey Hinton

Figure 1 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 2 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 3 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 4 for Pix2seq: A Language Modeling Framework for Object Detection
Viaarxiv icon

Machine learning pipeline for battery state of health estimation

Feb 01, 2021
Darius Roman, Saurabh Saxena, Valentin Robu, Michael Pecht, David Flynn

Figure 1 for Machine learning pipeline for battery state of health estimation
Figure 2 for Machine learning pipeline for battery state of health estimation
Figure 3 for Machine learning pipeline for battery state of health estimation
Figure 4 for Machine learning pipeline for battery state of health estimation
Viaarxiv icon

Non-Autoregressive Machine Translation with Latent Alignments

Apr 22, 2020
Chitwan Saharia, William Chan, Saurabh Saxena, Mohammad Norouzi

Figure 1 for Non-Autoregressive Machine Translation with Latent Alignments
Figure 2 for Non-Autoregressive Machine Translation with Latent Alignments
Figure 3 for Non-Autoregressive Machine Translation with Latent Alignments
Figure 4 for Non-Autoregressive Machine Translation with Latent Alignments
Viaarxiv icon