Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Licheng Yu

What is More Likely to Happen Next? Video-and-Language Future Event Prediction


Oct 15, 2020
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

* EMNLP 2020 (17 pages) 

  Access Paper or Ask Questions

Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models


May 15, 2020
Jize Cao, Zhe Gan, Yu Cheng, Licheng Yu, Yen-Chun Chen, Jingjing Liu


  Access Paper or Ask Questions

HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training


May 01, 2020
Linjie Li, Yen-Chun Chen, Yu Cheng, Zhe Gan, Licheng Yu, Jingjing Liu


  Access Paper or Ask Questions

BachGAN: High-Resolution Image Synthesis from Salient Object Layout


Mar 27, 2020
Yandong Li, Yu Cheng, Zhe Gan, Licheng Yu, Liqiang Wang, Jingjing Liu

* Accepted to CVPR 2020 

  Access Paper or Ask Questions

VIOLIN: A Large-Scale Dataset for Video-and-Language Inference


Mar 25, 2020
Jingzhou Liu, Wenhu Chen, Yu Cheng, Zhe Gan, Licheng Yu, Yiming Yang, Jingjing Liu

* Accepted to CVPR2020 

  Access Paper or Ask Questions

TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval


Jan 24, 2020
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

* 18 pages 

  Access Paper or Ask Questions

UNITER: Learning UNiversal Image-TExt Representations


Sep 25, 2019
Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, Jingjing Liu


  Access Paper or Ask Questions

TVQA+: Spatio-Temporal Grounding for Video Question Answering


Apr 25, 2019
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

* 13 pages 

  Access Paper or Ask Questions

Multi-Target Embodied Question Answering


Apr 09, 2019
Licheng Yu, Xinlei Chen, Georgia Gkioxari, Mohit Bansal, Tamara L. Berg, Dhruv Batra

* 10 pages, 6 figures 

  Access Paper or Ask Questions

Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout


Apr 08, 2019
Hao Tan, Licheng Yu, Mohit Bansal

* NAACL 2019 (12 pages) 

  Access Paper or Ask Questions

TVQA: Localized, Compositional Video Question Answering


Sep 05, 2018
Jie Lei, Licheng Yu, Mohit Bansal, Tamara L. Berg

* EMNLP 2018 (13 pages; Data and Leaderboard at: http://tvqa.cs.unc.edu

  Access Paper or Ask Questions

A unified framework for manifold landmarking


Sep 02, 2018
Hongteng Xu, Licheng Yu, Mark Davenport, Hongyuan Zha


  Access Paper or Ask Questions

MAttNet: Modular Attention Network for Referring Expression Comprehension


Mar 27, 2018
Licheng Yu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Mohit Bansal, Tamara L. Berg

* Equation of word attention fixed; MAttNet+Grabcut results added 

  Access Paper or Ask Questions

Hierarchically-Attentive RNN for Album Summarization and Storytelling


Aug 09, 2017
Licheng Yu, Mohit Bansal, Tamara L. Berg

* To appear at EMNLP-2017 (7 pages) 

  Access Paper or Ask Questions

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions


Apr 17, 2017
Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

* Some typo fixed; comprehension results on refcocog updated; more human evaluation results added 

  Access Paper or Ask Questions

Detailed Garment Recovery from a Single-View Image


Sep 12, 2016
Shan Yang, Tanya Ambert, Zherong Pan, Ke Wang, Licheng Yu, Tamara Berg, Ming C. Lin

* Comparison added. Algorithm added. Equations cleaned up 

  Access Paper or Ask Questions

Modeling Context in Referring Expressions


Aug 10, 2016
Licheng Yu, Patrick Poirson, Shan Yang, Alexander C. Berg, Tamara L. Berg

* 19 pages, 6 figures, in ECCV 2016; authors, references and acknowledgement updated 

  Access Paper or Ask Questions

Visual Madlibs: Fill in the blank Image Generation and Question Answering


May 31, 2015
Licheng Yu, Eunbyung Park, Alexander C. Berg, Tamara L. Berg

* 10 pages; 8 figures; 4 tables 

  Access Paper or Ask Questions