Picture for Ishan Misra

Ishan Misra

Jack

Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision

Add code
Feb 22, 2022
Figure 1 for Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision
Figure 2 for Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision
Figure 3 for Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision
Figure 4 for Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision
Viaarxiv icon

A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments

Add code
Feb 16, 2022
Figure 1 for A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments
Figure 2 for A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments
Figure 3 for A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments
Figure 4 for A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments
Viaarxiv icon

Omnivore: A Single Model for Many Visual Modalities

Add code
Jan 20, 2022
Figure 1 for Omnivore: A Single Model for Many Visual Modalities
Figure 2 for Omnivore: A Single Model for Many Visual Modalities
Figure 3 for Omnivore: A Single Model for Many Visual Modalities
Figure 4 for Omnivore: A Single Model for Many Visual Modalities
Viaarxiv icon

Detecting Twenty-thousand Classes using Image-level Supervision

Add code
Jan 10, 2022
Figure 1 for Detecting Twenty-thousand Classes using Image-level Supervision
Figure 2 for Detecting Twenty-thousand Classes using Image-level Supervision
Figure 3 for Detecting Twenty-thousand Classes using Image-level Supervision
Figure 4 for Detecting Twenty-thousand Classes using Image-level Supervision
Viaarxiv icon

Mask2Former for Video Instance Segmentation

Add code
Dec 20, 2021
Figure 1 for Mask2Former for Video Instance Segmentation
Figure 2 for Mask2Former for Video Instance Segmentation
Figure 3 for Mask2Former for Video Instance Segmentation
Viaarxiv icon

Masked-attention Mask Transformer for Universal Image Segmentation

Add code
Dec 10, 2021
Figure 1 for Masked-attention Mask Transformer for Universal Image Segmentation
Figure 2 for Masked-attention Mask Transformer for Universal Image Segmentation
Figure 3 for Masked-attention Mask Transformer for Universal Image Segmentation
Figure 4 for Masked-attention Mask Transformer for Universal Image Segmentation
Viaarxiv icon

Frame Averaging for Invariant and Equivariant Network Design

Add code
Oct 07, 2021
Figure 1 for Frame Averaging for Invariant and Equivariant Network Design
Figure 2 for Frame Averaging for Invariant and Equivariant Network Design
Figure 3 for Frame Averaging for Invariant and Equivariant Network Design
Figure 4 for Frame Averaging for Invariant and Equivariant Network Design
Viaarxiv icon

An End-to-End Transformer Model for 3D Object Detection

Add code
Sep 16, 2021
Figure 1 for An End-to-End Transformer Model for 3D Object Detection
Figure 2 for An End-to-End Transformer Model for 3D Object Detection
Figure 3 for An End-to-End Transformer Model for 3D Object Detection
Figure 4 for An End-to-End Transformer Model for 3D Object Detection
Viaarxiv icon

Emerging Properties in Self-Supervised Vision Transformers

Add code
May 24, 2021
Figure 1 for Emerging Properties in Self-Supervised Vision Transformers
Figure 2 for Emerging Properties in Self-Supervised Vision Transformers
Figure 3 for Emerging Properties in Self-Supervised Vision Transformers
Figure 4 for Emerging Properties in Self-Supervised Vision Transformers
Viaarxiv icon

3D Spatial Recognition without Spatially Labeled 3D

Add code
May 13, 2021
Figure 1 for 3D Spatial Recognition without Spatially Labeled 3D
Figure 2 for 3D Spatial Recognition without Spatially Labeled 3D
Figure 3 for 3D Spatial Recognition without Spatially Labeled 3D
Figure 4 for 3D Spatial Recognition without Spatially Labeled 3D
Viaarxiv icon