Picture for Pengchuan Zhang

Pengchuan Zhang

Jack

Efficient Self-supervised Vision Transformers for Representation Learning

Add code
Jun 17, 2021
Figure 1 for Efficient Self-supervised Vision Transformers for Representation Learning
Figure 2 for Efficient Self-supervised Vision Transformers for Representation Learning
Figure 3 for Efficient Self-supervised Vision Transformers for Representation Learning
Figure 4 for Efficient Self-supervised Vision Transformers for Representation Learning
Viaarxiv icon

3DB: A Framework for Debugging Computer Vision Models

Add code
Jun 07, 2021
Figure 1 for 3DB: A Framework for Debugging Computer Vision Models
Figure 2 for 3DB: A Framework for Debugging Computer Vision Models
Figure 3 for 3DB: A Framework for Debugging Computer Vision Models
Figure 4 for 3DB: A Framework for Debugging Computer Vision Models
Viaarxiv icon

Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference

Add code
May 12, 2021
Figure 1 for Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Figure 2 for Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Figure 3 for Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Figure 4 for Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Viaarxiv icon

Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding

Add code
Mar 29, 2021
Figure 1 for Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Figure 2 for Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Figure 3 for Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Figure 4 for Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Viaarxiv icon

Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix

Add code
Jan 16, 2021
Figure 1 for Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix
Figure 2 for Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix
Figure 3 for Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix
Figure 4 for Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix
Viaarxiv icon

VinVL: Making Visual Representations Matter in Vision-Language Models

Add code
Jan 02, 2021
Figure 1 for VinVL: Making Visual Representations Matter in Vision-Language Models
Figure 2 for VinVL: Making Visual Representations Matter in Vision-Language Models
Figure 3 for VinVL: Making Visual Representations Matter in Vision-Language Models
Figure 4 for VinVL: Making Visual Representations Matter in Vision-Language Models
Viaarxiv icon

MiniVLM: A Smaller and Faster Vision-Language Model

Add code
Dec 13, 2020
Figure 1 for MiniVLM: A Smaller and Faster Vision-Language Model
Figure 2 for MiniVLM: A Smaller and Faster Vision-Language Model
Figure 3 for MiniVLM: A Smaller and Faster Vision-Language Model
Figure 4 for MiniVLM: A Smaller and Faster Vision-Language Model
Viaarxiv icon

MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network

Add code
Oct 03, 2020
Figure 1 for MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network
Figure 2 for MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network
Figure 3 for MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network
Figure 4 for MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network
Viaarxiv icon

Novel Human-Object Interaction Detection via Adversarial Domain Generalization

Add code
May 22, 2020
Figure 1 for Novel Human-Object Interaction Detection via Adversarial Domain Generalization
Figure 2 for Novel Human-Object Interaction Detection via Adversarial Domain Generalization
Figure 3 for Novel Human-Object Interaction Detection via Adversarial Domain Generalization
Figure 4 for Novel Human-Object Interaction Detection via Adversarial Domain Generalization
Viaarxiv icon

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

Add code
May 18, 2020
Figure 1 for Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Figure 2 for Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Figure 3 for Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Figure 4 for Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Viaarxiv icon