Alert button
Picture for Pengchuan Zhang

Pengchuan Zhang

Alert button

Grounded Language-Image Pre-training

Dec 07, 2021
Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao

Figure 1 for Grounded Language-Image Pre-training
Figure 2 for Grounded Language-Image Pre-training
Figure 3 for Grounded Language-Image Pre-training
Figure 4 for Grounded Language-Image Pre-training
Viaarxiv icon

An Empirical Study of Training End-to-End Vision-and-Language Transformers

Nov 25, 2021
Zi-Yi Dou, Yichong Xu, Zhe Gan, Jianfeng Wang, Shuohang Wang, Lijuan Wang, Chenguang Zhu, Pengchuan Zhang, Lu Yuan, Nanyun Peng, Zicheng Liu, Michael Zeng

Figure 1 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Figure 2 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Figure 3 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Figure 4 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Viaarxiv icon

Florence: A New Foundation Model for Computer Vision

Nov 22, 2021
Lu Yuan, Dongdong Chen, Yi-Ling Chen, Noel Codella, Xiyang Dai, Jianfeng Gao, Houdong Hu, Xuedong Huang, Boxin Li, Chunyuan Li, Ce Liu, Mengchen Liu, Zicheng Liu, Yumao Lu, Yu Shi, Lijuan Wang, Jianfeng Wang, Bin Xiao, Zhen Xiao, Jianwei Yang, Michael Zeng, Luowei Zhou, Pengchuan Zhang

Figure 1 for Florence: A New Foundation Model for Computer Vision
Figure 2 for Florence: A New Foundation Model for Computer Vision
Figure 3 for Florence: A New Foundation Model for Computer Vision
Figure 4 for Florence: A New Foundation Model for Computer Vision
Viaarxiv icon

Image Scene Graph Generation (SGG) Benchmark

Jul 27, 2021
Xiaotian Han, Jianwei Yang, Houdong Hu, Lei Zhang, Jianfeng Gao, Pengchuan Zhang

Figure 1 for Image Scene Graph Generation (SGG) Benchmark
Figure 2 for Image Scene Graph Generation (SGG) Benchmark
Figure 3 for Image Scene Graph Generation (SGG) Benchmark
Figure 4 for Image Scene Graph Generation (SGG) Benchmark
Viaarxiv icon

Focal Self-attention for Local-Global Interactions in Vision Transformers

Jul 01, 2021
Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Xiyang Dai, Bin Xiao, Lu Yuan, Jianfeng Gao

Figure 1 for Focal Self-attention for Local-Global Interactions in Vision Transformers
Figure 2 for Focal Self-attention for Local-Global Interactions in Vision Transformers
Figure 3 for Focal Self-attention for Local-Global Interactions in Vision Transformers
Figure 4 for Focal Self-attention for Local-Global Interactions in Vision Transformers
Viaarxiv icon

Efficient Self-supervised Vision Transformers for Representation Learning

Jun 17, 2021
Chunyuan Li, Jianwei Yang, Pengchuan Zhang, Mei Gao, Bin Xiao, Xiyang Dai, Lu Yuan, Jianfeng Gao

Figure 1 for Efficient Self-supervised Vision Transformers for Representation Learning
Figure 2 for Efficient Self-supervised Vision Transformers for Representation Learning
Figure 3 for Efficient Self-supervised Vision Transformers for Representation Learning
Figure 4 for Efficient Self-supervised Vision Transformers for Representation Learning
Viaarxiv icon

3DB: A Framework for Debugging Computer Vision Models

Jun 07, 2021
Guillaume Leclerc, Hadi Salman, Andrew Ilyas, Sai Vemprala, Logan Engstrom, Vibhav Vineet, Kai Xiao, Pengchuan Zhang, Shibani Santurkar, Greg Yang, Ashish Kapoor, Aleksander Madry

Figure 1 for 3DB: A Framework for Debugging Computer Vision Models
Figure 2 for 3DB: A Framework for Debugging Computer Vision Models
Figure 3 for 3DB: A Framework for Debugging Computer Vision Models
Figure 4 for 3DB: A Framework for Debugging Computer Vision Models
Viaarxiv icon

Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference

May 12, 2021
Shumao Zhang, Pengchuan Zhang, Thomas Y. Hou

Figure 1 for Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Figure 2 for Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Figure 3 for Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Figure 4 for Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Viaarxiv icon

Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding

Mar 29, 2021
Pengchuan Zhang, Xiyang Dai, Jianwei Yang, Bin Xiao, Lu Yuan, Lei Zhang, Jianfeng Gao

Figure 1 for Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Figure 2 for Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Figure 3 for Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Figure 4 for Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Viaarxiv icon

Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix

Jan 16, 2021
Ruocheng Guo, Pengchuan Zhang, Hao Liu, Emre Kiciman

Figure 1 for Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix
Figure 2 for Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix
Figure 3 for Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix
Figure 4 for Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix
Viaarxiv icon