Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
What is More Likely to Happen Next? Video-and-Language Future Event Prediction

Oct 15, 2020
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

* EMNLP 2020 (17 pages) 

  Access Paper or Ask Questions

Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

May 15, 2020
Jize Cao, Zhe Gan, Yu Cheng, Licheng Yu, Yen-Chun Chen, Jingjing Liu


  Access Paper or Ask Questions

HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training

May 01, 2020
Linjie Li, Yen-Chun Chen, Yu Cheng, Zhe Gan, Licheng Yu, Jingjing Liu


  Access Paper or Ask Questions

BachGAN: High-Resolution Image Synthesis from Salient Object Layout

Mar 27, 2020
Yandong Li, Yu Cheng, Zhe Gan, Licheng Yu, Liqiang Wang, Jingjing Liu

* Accepted to CVPR 2020 

  Access Paper or Ask Questions

VIOLIN: A Large-Scale Dataset for Video-and-Language Inference

Mar 25, 2020
Jingzhou Liu, Wenhu Chen, Yu Cheng, Zhe Gan, Licheng Yu, Yiming Yang, Jingjing Liu

* Accepted to CVPR2020 

  Access Paper or Ask Questions

TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

Jan 24, 2020
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

* 18 pages 

  Access Paper or Ask Questions

UNITER: Learning UNiversal Image-TExt Representations

Sep 25, 2019
Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, Jingjing Liu


  Access Paper or Ask Questions

TVQA+: Spatio-Temporal Grounding for Video Question Answering

Apr 25, 2019
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

* 13 pages 

  Access Paper or Ask Questions

Multi-Target Embodied Question Answering

Apr 09, 2019
Licheng Yu, Xinlei Chen, Georgia Gkioxari, Mohit Bansal, Tamara L. Berg, Dhruv Batra

* 10 pages, 6 figures 

  Access Paper or Ask Questions

Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout

Apr 08, 2019
Hao Tan, Licheng Yu, Mohit Bansal

* NAACL 2019 (12 pages) 

  Access Paper or Ask Questions

TVQA: Localized, Compositional Video Question Answering

Sep 05, 2018
Jie Lei, Licheng Yu, Mohit Bansal, Tamara L. Berg

* EMNLP 2018 (13 pages; Data and Leaderboard at: http://tvqa.cs.unc.edu

  Access Paper or Ask Questions

A unified framework for manifold landmarking

Sep 02, 2018
Hongteng Xu, Licheng Yu, Mark Davenport, Hongyuan Zha


  Access Paper or Ask Questions

MAttNet: Modular Attention Network for Referring Expression Comprehension

Mar 27, 2018
Licheng Yu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Mohit Bansal, Tamara L. Berg

* Equation of word attention fixed; MAttNet+Grabcut results added 

  Access Paper or Ask Questions

Hierarchically-Attentive RNN for Album Summarization and Storytelling

Aug 09, 2017
Licheng Yu, Mohit Bansal, Tamara L. Berg

* To appear at EMNLP-2017 (7 pages) 

  Access Paper or Ask Questions

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Apr 17, 2017
Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

* Some typo fixed; comprehension results on refcocog updated; more human evaluation results added 

  Access Paper or Ask Questions

Detailed Garment Recovery from a Single-View Image

Sep 12, 2016
Shan Yang, Tanya Ambert, Zherong Pan, Ke Wang, Licheng Yu, Tamara Berg, Ming C. Lin

* Comparison added. Algorithm added. Equations cleaned up 

  Access Paper or Ask Questions

Modeling Context in Referring Expressions

Aug 10, 2016
Licheng Yu, Patrick Poirson, Shan Yang, Alexander C. Berg, Tamara L. Berg

* 19 pages, 6 figures, in ECCV 2016; authors, references and acknowledgement updated 

  Access Paper or Ask Questions

Visual Madlibs: Fill in the blank Image Generation and Question Answering

May 31, 2015
Licheng Yu, Eunbyung Park, Alexander C. Berg, Tamara L. Berg

* 10 pages; 8 figures; 4 tables 

  Access Paper or Ask Questions