Alert button
Picture for Licheng Yu

Licheng Yu

Alert button

VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation

Add code
Bookmark button
Alert button
Jun 08, 2021
Linjie Li, Jie Lei, Zhe Gan, Licheng Yu, Yen-Chun Chen, Rohit Pillai, Yu Cheng, Luowei Zhou, Xin Eric Wang, William Yang Wang, Tamara Lee Berg, Mohit Bansal, Jingjing Liu, Lijuan Wang, Zicheng Liu

Figure 1 for VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
Figure 2 for VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
Figure 3 for VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
Figure 4 for VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
Viaarxiv icon

Connecting What to Say With Where to Look by Modeling Human Attention Traces

Add code
Bookmark button
Alert button
May 12, 2021
Zihang Meng, Licheng Yu, Ning Zhang, Tamara Berg, Babak Damavandi, Vikas Singh, Amy Bearman

Figure 1 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Figure 2 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Figure 3 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Figure 4 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Viaarxiv icon

What is More Likely to Happen Next? Video-and-Language Future Event Prediction

Add code
Bookmark button
Alert button
Oct 15, 2020
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

Figure 1 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Figure 2 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Figure 3 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Figure 4 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Viaarxiv icon

Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

Add code
Bookmark button
Alert button
May 15, 2020
Jize Cao, Zhe Gan, Yu Cheng, Licheng Yu, Yen-Chun Chen, Jingjing Liu

Figure 1 for Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
Figure 2 for Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
Figure 3 for Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
Figure 4 for Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
Viaarxiv icon

HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training

Add code
Bookmark button
Alert button
May 01, 2020
Linjie Li, Yen-Chun Chen, Yu Cheng, Zhe Gan, Licheng Yu, Jingjing Liu

Figure 1 for HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Figure 2 for HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Figure 3 for HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Figure 4 for HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Viaarxiv icon

BachGAN: High-Resolution Image Synthesis from Salient Object Layout

Add code
Bookmark button
Alert button
Mar 27, 2020
Yandong Li, Yu Cheng, Zhe Gan, Licheng Yu, Liqiang Wang, Jingjing Liu

Figure 1 for BachGAN: High-Resolution Image Synthesis from Salient Object Layout
Figure 2 for BachGAN: High-Resolution Image Synthesis from Salient Object Layout
Figure 3 for BachGAN: High-Resolution Image Synthesis from Salient Object Layout
Figure 4 for BachGAN: High-Resolution Image Synthesis from Salient Object Layout
Viaarxiv icon

VIOLIN: A Large-Scale Dataset for Video-and-Language Inference

Add code
Bookmark button
Alert button
Mar 25, 2020
Jingzhou Liu, Wenhu Chen, Yu Cheng, Zhe Gan, Licheng Yu, Yiming Yang, Jingjing Liu

Figure 1 for VIOLIN: A Large-Scale Dataset for Video-and-Language Inference
Figure 2 for VIOLIN: A Large-Scale Dataset for Video-and-Language Inference
Figure 3 for VIOLIN: A Large-Scale Dataset for Video-and-Language Inference
Figure 4 for VIOLIN: A Large-Scale Dataset for Video-and-Language Inference
Viaarxiv icon

TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

Add code
Bookmark button
Alert button
Jan 24, 2020
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

Figure 1 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Figure 2 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Figure 3 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Figure 4 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Viaarxiv icon

UNITER: Learning UNiversal Image-TExt Representations

Add code
Bookmark button
Alert button
Sep 25, 2019
Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, Jingjing Liu

Figure 1 for UNITER: Learning UNiversal Image-TExt Representations
Figure 2 for UNITER: Learning UNiversal Image-TExt Representations
Figure 3 for UNITER: Learning UNiversal Image-TExt Representations
Figure 4 for UNITER: Learning UNiversal Image-TExt Representations
Viaarxiv icon