Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation


Feb 23, 2022
Shizhe Chen, Pierre-Louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

History Aware Multimodal Transformer for Vision-and-Language Navigation


Oct 25, 2021
Shizhe Chen, Pierre-Louis Guhur, Cordelia Schmid, Ivan Laptev

* Accepted in NeurIPS 2021; project page at https://cshizhe.github.io/projects/vln_hamt.html 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training


Aug 25, 2021
Yuqing Song, Shizhe Chen, Qin Jin, Wei Luo, Jun Xie, Fei Huang

* Accepted as Oral by ACMMM 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Airbert: In-domain Pretraining for Vision-and-Language Navigation


Aug 20, 2021
Pierre-Louis Guhur, Makarand Tapaswi, Shizhe Chen, Ivan Laptev, Cordelia Schmid

* To be published on ICCV 2021. Webpage is at https://airbert-vln.github.io/ linking to our dataset, codes and models 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Elaborative Rehearsal for Zero-shot Action Recognition


Aug 18, 2021
Shizhe Chen, Dong Huang

* Accepted by ICCV 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Question-controlled Text-aware Image Captioning


Aug 04, 2021
Anwen Hu, Shizhe Chen, Qin Jin

* 10 pages, 8 figures, to appear in ACM MM 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

ICECAP: Information Concentrated Entity-aware Image Captioning


Aug 04, 2021
Anwen Hu, Shizhe Chen, Qin Jin

* 9 pages, 7 figures, ACM MM 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization


Jun 11, 2021
Ludan Ruan, Jieting Chen, Yuqing Song, Shizhe Chen, Qin Jin

* 6 pages, 4 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Towards Diverse Paragraph Captioning for Untrimmed Videos


May 30, 2021
Yuqing Song, Shizhe Chen, Qin Jin

* Accepted by CVPR 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
>>