Alert button
Picture for Lijuan Wang

Lijuan Wang

Alert button

NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis

Add code
Bookmark button
Alert button
Jul 20, 2022
Chenfei Wu, Jian Liang, Xiaowei Hu, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan

Figure 1 for NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Figure 2 for NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Figure 3 for NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Figure 4 for NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Viaarxiv icon

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Add code
Bookmark button
Alert button
Jun 15, 2022
Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan Wang

Figure 1 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Figure 2 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Figure 3 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Figure 4 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Viaarxiv icon

LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling

Add code
Bookmark button
Alert button
Jun 14, 2022
Linjie Li, Zhe Gan, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Ce Liu, Lijuan Wang

Figure 1 for LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling
Figure 2 for LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling
Figure 3 for LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling
Figure 4 for LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling
Viaarxiv icon

GLIPv2: Unifying Localization and Vision-Language Understanding

Add code
Bookmark button
Alert button
Jun 12, 2022
Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng Gao

Figure 1 for GLIPv2: Unifying Localization and Vision-Language Understanding
Figure 2 for GLIPv2: Unifying Localization and Vision-Language Understanding
Figure 3 for GLIPv2: Unifying Localization and Vision-Language Understanding
Figure 4 for GLIPv2: Unifying Localization and Vision-Language Understanding
Viaarxiv icon

GIT: A Generative Image-to-text Transformer for Vision and Language

Add code
Bookmark button
Alert button
May 31, 2022
Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu, Ce Liu, Lijuan Wang

Figure 1 for GIT: A Generative Image-to-text Transformer for Vision and Language
Figure 2 for GIT: A Generative Image-to-text Transformer for Vision and Language
Figure 3 for GIT: A Generative Image-to-text Transformer for Vision and Language
Figure 4 for GIT: A Generative Image-to-text Transformer for Vision and Language
Viaarxiv icon

Cross-modal Representation Learning for Zero-shot Action Recognition

Add code
Bookmark button
Alert button
May 03, 2022
Chung-Ching Lin, Kevin Lin, Linjie Li, Lijuan Wang, Zicheng Liu

Figure 1 for Cross-modal Representation Learning for Zero-shot Action Recognition
Figure 2 for Cross-modal Representation Learning for Zero-shot Action Recognition
Figure 3 for Cross-modal Representation Learning for Zero-shot Action Recognition
Figure 4 for Cross-modal Representation Learning for Zero-shot Action Recognition
Viaarxiv icon

K-LITE: Learning Transferable Visual Models with External Knowledge

Add code
Bookmark button
Alert button
Apr 20, 2022
Sheng Shen, Chunyuan Li, Xiaowei Hu, Yujia Xie, Jianwei Yang, Pengchuan Zhang, Anna Rohrbach, Zhe Gan, Lijuan Wang, Lu Yuan, Ce Liu, Kurt Keutzer, Trevor Darrell, Jianfeng Gao

Figure 1 for K-LITE: Learning Transferable Visual Models with External Knowledge
Figure 2 for K-LITE: Learning Transferable Visual Models with External Knowledge
Figure 3 for K-LITE: Learning Transferable Visual Models with External Knowledge
Figure 4 for K-LITE: Learning Transferable Visual Models with External Knowledge
Viaarxiv icon

The Overlooked Classifier in Human-Object Interaction Recognition

Add code
Bookmark button
Alert button
Mar 10, 2022
Ying Jin, Yinpeng Chen, Lijuan Wang, Jianfeng Wang, Pei Yu, Lin Liang, Jenq-Neng Hwang, Zicheng Liu

Figure 1 for The Overlooked Classifier in Human-Object Interaction Recognition
Figure 2 for The Overlooked Classifier in Human-Object Interaction Recognition
Figure 3 for The Overlooked Classifier in Human-Object Interaction Recognition
Figure 4 for The Overlooked Classifier in Human-Object Interaction Recognition
Viaarxiv icon

Decoupling Object Detection from Human-Object Interaction Recognition

Add code
Bookmark button
Alert button
Dec 13, 2021
Ying Jin, Yinpeng Chen, Lijuan Wang, Jianfeng Wang, Pei Yu, Lin Liang, Jenq-Neng Hwang, Zicheng Liu

Figure 1 for Decoupling Object Detection from Human-Object Interaction Recognition
Figure 2 for Decoupling Object Detection from Human-Object Interaction Recognition
Figure 3 for Decoupling Object Detection from Human-Object Interaction Recognition
Figure 4 for Decoupling Object Detection from Human-Object Interaction Recognition
Viaarxiv icon

Injecting Semantic Concepts into End-to-End Image Captioning

Add code
Bookmark button
Alert button
Dec 09, 2021
Zhiyuan Fang, Jianfeng Wang, Xiaowei Hu, Lin Liang, Zhe Gan, Lijuan Wang, Yezhou Yang, Zicheng Liu

Figure 1 for Injecting Semantic Concepts into End-to-End Image Captioning
Figure 2 for Injecting Semantic Concepts into End-to-End Image Captioning
Figure 3 for Injecting Semantic Concepts into End-to-End Image Captioning
Figure 4 for Injecting Semantic Concepts into End-to-End Image Captioning
Viaarxiv icon