Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

GEM: A General Evaluation Benchmark for Multimodal Tasks



Lin Su , Nan Duan , Edward Cui , Lei Ji , Chenfei Wu , Huaishao Luo , Yongfei Liu , Ming Zhong , Taroon Bharti , Arun Sacheti

* Accepted by Findings of ACL 2021 

   Access Paper or Ask Questions

M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training



Haoyang Huang , Lin Su , Di Qi , Nan Duan , Edward Cui , Taroon Bharti , Lei Zhang , Lijuan Wang , Jianfeng Gao , Bei Liu , Jianlong Fu , Dongdong Zhang , Xin Liu , Ming Zhou

* 10 pages,2 figures 

   Access Paper or Ask Questions

XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation



Yaobo Liang , Nan Duan , Yeyun Gong , Ning Wu , Fenfei Guo , Weizhen Qi , Ming Gong , Linjun Shou , Daxin Jiang , Guihong Cao , Xiaodong Fan , Bruce Zhang , Rahul Agrawal , Edward Cui , Sining Wei , Taroon Bharti , Ying Qiao , Jiun-Hung Chen , Winnie Wu , Shuguang Liu , Fan Yang , Rangan Majumder , Ming Zhou


   Access Paper or Ask Questions

XGPT: Cross-modal Generative Pre-Training for Image Captioning



Qiaolin Xia , Haoyang Huang , Nan Duan , Dongdong Zhang , Lei Ji , Zhifang Sui , Edward Cui , Taroon Bharti , Xin Liu , Ming Zhou

* 12 pages, 3 figures, 7 tables 

   Access Paper or Ask Questions

ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data



Di Qi , Lin Su , Jia Song , Edward Cui , Taroon Bharti , Arun Sacheti


   Access Paper or Ask Questions