Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for David Harwath

Text-Free Image-to-Speech Synthesis Using Learned Segmental Units


Dec 31, 2020
Wei-Ning Hsu, David Harwath, Christopher Song, James Glass


  Access Paper or Ask Questions

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos


Jun 16, 2020
Andrew Rouditchenko, Angie Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogerio Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James Glass


  Access Paper or Ask Questions

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech


Nov 21, 2019
David Harwath, Wei-Ning Hsu, James Glass


  Access Paper or Ask Questions

Transfer Learning from Audio-Visual Grounding to Speech Recognition


Jul 09, 2019
Wei-Ning Hsu, David Harwath, James Glass

* Accepted to Interspeech 2019. 4 pages, 2 figures 

  Access Paper or Ask Questions

Towards Visually Grounded Sub-Word Speech Unit Discovery


Feb 21, 2019
David Harwath, James Glass

* Accepted to ICASSP 2019 

  Access Paper or Ask Questions

Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech


Apr 09, 2018
David Harwath, Galen Chuang, James Glass

* to appear at ICASSP 2018 

  Access Paper or Ask Questions

Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input


Apr 04, 2018
David Harwath, Adrià Recasens, Dídac Surís, Galen Chuang, Antonio Torralba, James Glass


  Access Paper or Ask Questions

Learning Modality-Invariant Representations for Speech and Images


Dec 11, 2017
Kenneth Leidal, David Harwath, James Glass


  Access Paper or Ask Questions

Learning Word-Like Units from Joint Audio-Visual Analysis


May 24, 2017
David Harwath, James R. Glass


  Access Paper or Ask Questions

Deep Multimodal Semantic Embeddings for Speech and Images


Nov 11, 2015
David Harwath, James Glass


  Access Paper or Ask Questions