Picture for Alan Yuille

Alan Yuille

Johns Hopkins University

In Defense of Image Pre-Training for Spatiotemporal Recognition

Add code
May 03, 2022
Figure 1 for In Defense of Image Pre-Training for Spatiotemporal Recognition
Figure 2 for In Defense of Image Pre-Training for Spatiotemporal Recognition
Figure 3 for In Defense of Image Pre-Training for Spatiotemporal Recognition
Figure 4 for In Defense of Image Pre-Training for Spatiotemporal Recognition
Viaarxiv icon

Fast AdvProp

Add code
Apr 21, 2022
Figure 1 for Fast AdvProp
Figure 2 for Fast AdvProp
Figure 3 for Fast AdvProp
Figure 4 for Fast AdvProp
Viaarxiv icon

SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering

Add code
Apr 05, 2022
Figure 1 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Figure 2 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Figure 3 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Figure 4 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Viaarxiv icon

CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation

Add code
Mar 22, 2022
Figure 1 for CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation
Figure 2 for CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation
Figure 3 for CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation
Figure 4 for CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation
Viaarxiv icon

DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection

Add code
Mar 15, 2022
Figure 1 for DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
Figure 2 for DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
Figure 3 for DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
Figure 4 for DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
Viaarxiv icon

Point-Level Region Contrast for Object Detection Pre-Training

Add code
Feb 09, 2022
Figure 1 for Point-Level Region Contrast for Object Detection Pre-Training
Figure 2 for Point-Level Region Contrast for Object Detection Pre-Training
Figure 3 for Point-Level Region Contrast for Object Detection Pre-Training
Figure 4 for Point-Level Region Contrast for Object Detection Pre-Training
Viaarxiv icon

Lite Vision Transformer with Enhanced Self-Attention

Add code
Dec 20, 2021
Figure 1 for Lite Vision Transformer with Enhanced Self-Attention
Figure 2 for Lite Vision Transformer with Enhanced Self-Attention
Figure 3 for Lite Vision Transformer with Enhanced Self-Attention
Figure 4 for Lite Vision Transformer with Enhanced Self-Attention
Viaarxiv icon

Masked Feature Prediction for Self-Supervised Visual Pre-Training

Add code
Dec 16, 2021
Figure 1 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Figure 2 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Figure 3 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Figure 4 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Viaarxiv icon

Learning from Temporal Gradient for Semi-supervised Action Recognition

Add code
Dec 06, 2021
Figure 1 for Learning from Temporal Gradient for Semi-supervised Action Recognition
Figure 2 for Learning from Temporal Gradient for Semi-supervised Action Recognition
Figure 3 for Learning from Temporal Gradient for Semi-supervised Action Recognition
Figure 4 for Learning from Temporal Gradient for Semi-supervised Action Recognition
Viaarxiv icon

MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification

Add code
Dec 03, 2021
Figure 1 for MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification
Figure 2 for MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification
Figure 3 for MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification
Figure 4 for MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification
Viaarxiv icon