Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections



Chenliang Li , Haiyang Xu , Junfeng Tian , Wei Wang , Ming Yan , Bin Bi , Jiabo Ye , Hehong Chen , Guohai Xu , Zheng Cao , Ji Zhang , Songfang Huang , Fei Huang , Jingren Zhou , Luo Si


   Access Paper or Ask Questions

Achieving Human Parity on Visual Question Answering



Ming Yan , Haiyang Xu , Chenliang Li , Junfeng Tian , Bin Bi , Wei Wang , Weihua Chen , Xianzhe Xu , Fan Wang , Zheng Cao , Zhicheng Zhang , Qiyu Zhang , Ji Zhang , Songfang Huang , Fei Huang , Luo Si , Rong Jin


   Access Paper or Ask Questions

Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training



Ming Yan , Haiyang Xu , Chenliang Li , Bin Bi , Junfeng Tian , Min Gui , Wei Wang


   Access Paper or Ask Questions

E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning



Haiyang Xu , Ming Yan , Chenliang Li , Bin Bi , Songfang Huang , Wenming Xiao , Fei Huang

* ACL2021 main conference 

   Access Paper or Ask Questions

StructuralLM: Structural Pre-training for Form Understanding



Chenliang Li , Bin Bi , Ming Yan , Wei Wang , Songfang Huang , Fei Huang , Luo Si

* Accepted by ACL2021 main conference 

   Access Paper or Ask Questions

SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple Levels



Chenliang Li , Ming Yan , Haiyang Xu , Fuli Luo , Wei Wang , Bin Bi , Songfang Huang

* 10 pages, 4 figures 

   Access Paper or Ask Questions

Latent Template Induction with Gumbel-CRFs



Yao Fu , Chuanqi Tan , Bin Bi , Mosha Chen , Yansong Feng , Alexander M. Rush

* NeurIPS 2020 camera ready 

   Access Paper or Ask Questions

1
2
>>