Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Jiasen Lu

X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers


Sep 23, 2020
Jaemin Cho, Jiasen Lu, Dustin Schwenk, Hannaneh Hajishirzi, Aniruddha Kembhavi

* EMNLP 2020 

  Access Paper or Ask Questions

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data


Jul 24, 2020
Michael Cogswell, Jiasen Lu, Rishabh Jain, Stefan Lee, Devi Parikh, Dhruv Batra

* 19 pages, 8 figures 

  Access Paper or Ask Questions

Spatially Aware Multimodal Transformers for TextVQA


Jul 23, 2020
Yash Kant, Dhruv Batra, Peter Anderson, Alex Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal

* Accepted at European Conference on Computer Vision 2020 

  Access Paper or Ask Questions

12-in-1: Multi-Task Vision and Language Representation Learning


Dec 05, 2019
Jiasen Lu, Vedanuj Goswami, Marcus Rohrbach, Devi Parikh, Stefan Lee

* Jiasen Lu and Vedanuj Goswami contributed equally to this work 

  Access Paper or Ask Questions

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks


Aug 06, 2019
Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee

* 11 pages, 5 figures 

  Access Paper or Ask Questions

Emergence of Compositional Language with Deep Generational Transmission


Apr 19, 2019
Michael Cogswell, Jiasen Lu, Stefan Lee, Devi Parikh, Dhruv Batra


  Access Paper or Ask Questions

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation


Jan 10, 2019
Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong

* ICLR 2019, code is available at https://github.com/chihyaoma/selfmonitoring-agent 

  Access Paper or Ask Questions

Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition


Oct 01, 2018
Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh

* 18 pages, 10 figures, Oral Presentation in Conference on Robot Learning (CoRL) 2018 

  Access Paper or Ask Questions

Graph R-CNN for Scene Graph Generation


Aug 01, 2018
Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh

* 16 pages, ECCV 2018 camera ready 

  Access Paper or Ask Questions

Neural Baby Talk


Mar 27, 2018
Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh

* 12 pages, 7 figures, CVPR 2018 

  Access Paper or Ask Questions

ParlAI: A Dialog Research Software Platform


Mar 08, 2018
Alexander H. Miller, Will Feng, Adam Fisch, Jiasen Lu, Dhruv Batra, Antoine Bordes, Devi Parikh, Jason Weston


  Access Paper or Ask Questions

Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model


Oct 27, 2017
Jiasen Lu, Anitha Kannan, Jianwei Yang, Devi Parikh, Dhruv Batra

* 11 pages, 3 figures 

  Access Paper or Ask Questions

Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning


Jun 06, 2017
Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher

* 12 pages, 11 figures, CVPR2017 camera ready 

  Access Paper or Ask Questions

Hierarchical Question-Image Co-Attention for Visual Question Answering


Jan 19, 2017
Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh

* 11 pages, 7 figures, 3 tables in 2016 Conference on Neural Information Processing Systems (NIPS) 

  Access Paper or Ask Questions

VQA: Visual Question Answering


Oct 27, 2016
Aishwarya Agrawal, Jiasen Lu, Stanislaw Antol, Margaret Mitchell, C. Lawrence Zitnick, Dhruv Batra, Devi Parikh

* The first three authors contributed equally. International Conference on Computer Vision (ICCV) 2015 

  Access Paper or Ask Questions