Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Wei-Hong Chuang

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text


Apr 22, 2021
Hassan Akbari, Linagzhe Yuan, Rui Qian, Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong


  Access Paper or Ask Questions