Alert button
Picture for Lijuan Wang

Lijuan Wang

Alert button

Generalized Decoding for Pixel, Image, and Language

Add code
Bookmark button
Alert button
Dec 21, 2022
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao

Figure 1 for Generalized Decoding for Pixel, Image, and Language
Figure 2 for Generalized Decoding for Pixel, Image, and Language
Figure 3 for Generalized Decoding for Pixel, Image, and Language
Figure 4 for Generalized Decoding for Pixel, Image, and Language
Viaarxiv icon

Exploring Discrete Diffusion Models for Image Captioning

Add code
Bookmark button
Alert button
Dec 09, 2022
Zixin Zhu, Yixuan Wei, Jianfeng Wang, Zhe Gan, Zheng Zhang, Le Wang, Gang Hua, Lijuan Wang, Zicheng Liu, Han Hu

Figure 1 for Exploring Discrete Diffusion Models for Image Captioning
Figure 2 for Exploring Discrete Diffusion Models for Image Captioning
Figure 3 for Exploring Discrete Diffusion Models for Image Captioning
Figure 4 for Exploring Discrete Diffusion Models for Image Captioning
Viaarxiv icon

GRiT: A Generative Region-to-text Transformer for Object Understanding

Add code
Bookmark button
Alert button
Dec 01, 2022
Jialian Wu, Jianfeng Wang, Zhengyuan Yang, Zhe Gan, Zicheng Liu, Junsong Yuan, Lijuan Wang

Figure 1 for GRiT: A Generative Region-to-text Transformer for Object Understanding
Figure 2 for GRiT: A Generative Region-to-text Transformer for Object Understanding
Figure 3 for GRiT: A Generative Region-to-text Transformer for Object Understanding
Figure 4 for GRiT: A Generative Region-to-text Transformer for Object Understanding
Viaarxiv icon

MPT: Mesh Pre-Training with Transformers for Human Pose and Mesh Reconstruction

Add code
Bookmark button
Alert button
Nov 24, 2022
Kevin Lin, Chung-Ching Lin, Lin Liang, Zicheng Liu, Lijuan Wang

Figure 1 for MPT: Mesh Pre-Training with Transformers for Human Pose and Mesh Reconstruction
Figure 2 for MPT: Mesh Pre-Training with Transformers for Human Pose and Mesh Reconstruction
Figure 3 for MPT: Mesh Pre-Training with Transformers for Human Pose and Mesh Reconstruction
Figure 4 for MPT: Mesh Pre-Training with Transformers for Human Pose and Mesh Reconstruction
Viaarxiv icon

ReCo: Region-Controlled Text-to-Image Generation

Add code
Bookmark button
Alert button
Nov 23, 2022
Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Linjie Li, Kevin Lin, Chenfei Wu, Nan Duan, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang

Figure 1 for ReCo: Region-Controlled Text-to-Image Generation
Figure 2 for ReCo: Region-Controlled Text-to-Image Generation
Figure 3 for ReCo: Region-Controlled Text-to-Image Generation
Figure 4 for ReCo: Region-Controlled Text-to-Image Generation
Viaarxiv icon

Non-Contrastive Learning Meets Language-Image Pre-Training

Add code
Bookmark button
Alert button
Oct 17, 2022
Jinghao Zhou, Li Dong, Zhe Gan, Lijuan Wang, Furu Wei

Figure 1 for Non-Contrastive Learning Meets Language-Image Pre-Training
Figure 2 for Non-Contrastive Learning Meets Language-Image Pre-Training
Figure 3 for Non-Contrastive Learning Meets Language-Image Pre-Training
Figure 4 for Non-Contrastive Learning Meets Language-Image Pre-Training
Viaarxiv icon

Vision-Language Pre-training: Basics, Recent Advances, and Future Trends

Add code
Bookmark button
Alert button
Oct 17, 2022
Zhe Gan, Linjie Li, Chunyuan Li, Lijuan Wang, Zicheng Liu, Jianfeng Gao

Figure 1 for Vision-Language Pre-training: Basics, Recent Advances, and Future Trends
Figure 2 for Vision-Language Pre-training: Basics, Recent Advances, and Future Trends
Figure 3 for Vision-Language Pre-training: Basics, Recent Advances, and Future Trends
Figure 4 for Vision-Language Pre-training: Basics, Recent Advances, and Future Trends
Viaarxiv icon

Prompting GPT-3 To Be Reliable

Add code
Bookmark button
Alert button
Oct 17, 2022
Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-Graber, Lijuan Wang

Figure 1 for Prompting GPT-3 To Be Reliable
Figure 2 for Prompting GPT-3 To Be Reliable
Figure 3 for Prompting GPT-3 To Be Reliable
Figure 4 for Prompting GPT-3 To Be Reliable
Viaarxiv icon

An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling

Add code
Bookmark button
Alert button
Sep 04, 2022
Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu

Figure 1 for An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
Figure 2 for An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
Figure 3 for An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
Figure 4 for An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
Viaarxiv icon