Alert button
Picture for Lu Yuan

Lu Yuan

Alert button

CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks

Add code
Bookmark button
Alert button
Jan 15, 2022
Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Jianwei Yang, Xiyang Dai, Bin Xiao, Haoxuan You, Shih-Fu Chang, Lu Yuan

Figure 1 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Figure 2 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Figure 3 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Figure 4 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Viaarxiv icon

Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation

Add code
Bookmark button
Alert button
Jan 04, 2022
Qiankun Liu, Dongdong Chen, Qi Chu, Lu Yuan, Bin Liu, Lei Zhang, Nenghai Yu

Figure 1 for Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation
Figure 2 for Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation
Figure 3 for Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation
Figure 4 for Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation
Viaarxiv icon

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Add code
Bookmark button
Alert button
Dec 20, 2021
Shuyang Gu, Dong Chen, Jianmin Bao, Fang Wen, Bo Zhang, Dongdong Chen, Lu Yuan, Baining Guo

Figure 1 for Vector Quantized Diffusion Model for Text-to-Image Synthesis
Figure 2 for Vector Quantized Diffusion Model for Text-to-Image Synthesis
Figure 3 for Vector Quantized Diffusion Model for Text-to-Image Synthesis
Figure 4 for Vector Quantized Diffusion Model for Text-to-Image Synthesis
Viaarxiv icon

RegionCLIP: Region-based Language-Image Pretraining

Add code
Bookmark button
Alert button
Dec 16, 2021
Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao

Figure 1 for RegionCLIP: Region-based Language-Image Pretraining
Figure 2 for RegionCLIP: Region-based Language-Image Pretraining
Figure 3 for RegionCLIP: Region-based Language-Image Pretraining
Figure 4 for RegionCLIP: Region-based Language-Image Pretraining
Viaarxiv icon

HairCLIP: Design Your Hair by Text and Reference Image

Add code
Bookmark button
Alert button
Dec 09, 2021
Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Zhentao Tan, Lu Yuan, Weiming Zhang, Nenghai Yu

Figure 1 for HairCLIP: Design Your Hair by Text and Reference Image
Figure 2 for HairCLIP: Design Your Hair by Text and Reference Image
Figure 3 for HairCLIP: Design Your Hair by Text and Reference Image
Figure 4 for HairCLIP: Design Your Hair by Text and Reference Image
Viaarxiv icon

Grounded Language-Image Pre-training

Add code
Bookmark button
Alert button
Dec 07, 2021
Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao

Figure 1 for Grounded Language-Image Pre-training
Figure 2 for Grounded Language-Image Pre-training
Figure 3 for Grounded Language-Image Pre-training
Figure 4 for Grounded Language-Image Pre-training
Viaarxiv icon

General Facial Representation Learning in a Visual-Linguistic Manner

Add code
Bookmark button
Alert button
Dec 06, 2021
Yinglin Zheng, Hao Yang, Ting Zhang, Jianmin Bao, Dongdong Chen, Yangyu Huang, Lu Yuan, Dong Chen, Ming Zeng, Fang Wen

Figure 1 for General Facial Representation Learning in a Visual-Linguistic Manner
Figure 2 for General Facial Representation Learning in a Visual-Linguistic Manner
Figure 3 for General Facial Representation Learning in a Visual-Linguistic Manner
Figure 4 for General Facial Representation Learning in a Visual-Linguistic Manner
Viaarxiv icon

BEVT: BERT Pretraining of Video Transformers

Add code
Bookmark button
Alert button
Dec 02, 2021
Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan

Figure 1 for BEVT: BERT Pretraining of Video Transformers
Figure 2 for BEVT: BERT Pretraining of Video Transformers
Figure 3 for BEVT: BERT Pretraining of Video Transformers
Figure 4 for BEVT: BERT Pretraining of Video Transformers
Viaarxiv icon

An Empirical Study of Training End-to-End Vision-and-Language Transformers

Add code
Bookmark button
Alert button
Nov 25, 2021
Zi-Yi Dou, Yichong Xu, Zhe Gan, Jianfeng Wang, Shuohang Wang, Lijuan Wang, Chenguang Zhu, Pengchuan Zhang, Lu Yuan, Nanyun Peng, Zicheng Liu, Michael Zeng

Figure 1 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Figure 2 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Figure 3 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Figure 4 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Viaarxiv icon