Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Hassan Akbari

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text


Apr 22, 2021
Hassan Akbari, Linagzhe Yuan, Rui Qian, Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong


  Access Paper or Ask Questions

Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language


Nov 18, 2020
Hassan Akbari, Hamid Palangi, Jianwei Yang, Sudha Rao, Asli Celikyilmaz, Roland Fernandez, Paul Smolensky, Jianfeng Gao, Shih-Fu Chang


  Access Paper or Ask Questions

Multi-level Multimodal Common Semantic Space for Image-Phrase Grounding


Nov 28, 2018
Hassan Akbari, Svebor Karaman, Surabhi Bhargava, Brian Chen, Carl Vondrick, Shih-Fu Chang


  Access Paper or Ask Questions

Lip2AudSpec: Speech reconstruction from silent lip movements video


Oct 26, 2017
Hassan Akbari, Himani Arora, Liangliang Cao, Nima Mesgarani


  Access Paper or Ask Questions