Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation


Feb 23, 2022
Shizhe Chen, Pierre-Louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

History Aware Multimodal Transformer for Vision-and-Language Navigation


Oct 25, 2021
Shizhe Chen, Pierre-Louis Guhur, Cordelia Schmid, Ivan Laptev

* Accepted in NeurIPS 2021; project page at https://cshizhe.github.io/projects/vln_hamt.html 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Airbert: In-domain Pretraining for Vision-and-Language Navigation


Aug 20, 2021
Pierre-Louis Guhur, Makarand Tapaswi, Shizhe Chen, Ivan Laptev, Cordelia Schmid

* To be published on ICCV 2021. Webpage is at https://airbert-vln.github.io/ linking to our dataset, codes and models 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email