Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Andrew Rouditchenko

Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Oct 14, 2021
Ian Palmer, Andrew Rouditchenko, Andrei Barbu, Boris Katz, James Glass

* Presented at Interspeech 2021. This version contains additional experiments on the Spoken ObjectNet test set 

  Access Paper or Ask Questions

Cross-Modal Discrete Representation Learning

Jun 10, 2021
Alexander H. Liu, SouYoung Jin, Cheng-I Jeff Lai, Andrew Rouditchenko, Aude Oliva, James Glass

* Preprint 

  Access Paper or Ask Questions

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos

May 05, 2021
Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie Boggust, Rameswar Panda, Brian Kingsbury, Rogerio Feris, David Harwath, James Glass, Michael Picheny, Shih-Fu Chang

  Access Paper or Ask Questions

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos

Jun 16, 2020
Andrew Rouditchenko, Angie Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogerio Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James Glass

  Access Paper or Ask Questions

Label-efficient audio classification through multitask learning and self-supervision

Oct 19, 2019
Tyler Lee, Ting Gong, Suchismita Padhy, Andrew Rouditchenko, Anthony Ndirango

* Presented at ICLR 2019 Limited Labeled Data (LLD) Workshop 

  Access Paper or Ask Questions

Self-Supervised Audio-Visual Co-Segmentation

Apr 18, 2019
Andrew Rouditchenko, Hang Zhao, Chuang Gan, Josh McDermott, Antonio Torralba

* Accepted to ICASSP 2019 

  Access Paper or Ask Questions

The Sound of Pixels

Oct 14, 2018
Hang Zhao, Chuang Gan, Andrew Rouditchenko, Carl Vondrick, Josh McDermott, Antonio Torralba

  Access Paper or Ask Questions