Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Ramon Sanabria

Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval


Apr 08, 2021
Ramon Sanabria, Austin Waters, Jason Baldridge

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Multimodal Speech Recognition with Unstructured Audio Masking


Oct 16, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott

* Accepted to NLP Beyond Text workshop, EMNLP 2020 

  Access Paper or Ask Questions

Fine-Grained Grounding for Multimodal Speech Recognition


Oct 05, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott

* Accepted to Findings of EMNLP 2020 

  Access Paper or Ask Questions

Looking Enhances Listening: Recovering Missing Speech Using Images


Feb 13, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze

* Accepted to ICASSP 2020 

  Access Paper or Ask Questions

Multitask Learning For Different Subword Segmentations In Neural Machine Translation


Oct 27, 2019
Tejas Srinivasan, Ramon Sanabria, Florian Metze

* Accepted to 16th International Workshop on Spoken Language Translation (IWSLT) 2019 

  Access Paper or Ask Questions

Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions


Jun 30, 2019
Tejas Srinivasan, Ramon Sanabria, Florian Metze


  Access Paper or Ask Questions

Multimodal Grounding for Sequence-to-Sequence Speech Recognition


Nov 09, 2018
Ozan Caglayan, Ramon Sanabria, Shruti Palaskar, Loïc Barrault, Florian Metze

* Submitted to ICASSP 2019 

  Access Paper or Ask Questions

How2: A Large-scale Dataset for Multimodal Language Understanding


Nov 01, 2018
Ramon Sanabria, Ozan Caglayan, Shruti Palaskar, Desmond Elliott, Loïc Barrault, Lucia Specia, Florian Metze


  Access Paper or Ask Questions

Hierarchical Multi Task Learning With CTC


Jul 25, 2018
Ramon Sanabria, Florian Metze

* Submitted to SLT 2018 

  Access Paper or Ask Questions

Subword and Crossword Units for CTC Acoustic Models


Jun 18, 2018
Thomas Zenkel, Ramon Sanabria, Florian Metze, Alex Waibel

* Current version accepted at Interspeech 2018 

  Access Paper or Ask Questions

End-to-End Multimodal Speech Recognition


Apr 25, 2018
Shruti Palaskar, Ramon Sanabria, Florian Metze

* 5 pages, 5 figures, Accepted at IEEE International Conference on Acoustics, Speech and Signal Processing 2018 (ICASSP 2018) 

  Access Paper or Ask Questions

Sequence-based Multi-lingual Low Resource Speech Recognition


Mar 06, 2018
Siddharth Dalmia, Ramon Sanabria, Florian Metze, Alan W. Black

* 5 pages, 5 figures, to appear in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018) 

  Access Paper or Ask Questions

Comparison of Decoding Strategies for CTC Acoustic Models


Aug 15, 2017
Thomas Zenkel, Ramon Sanabria, Florian Metze, Jan Niehues, Matthias Sperber, Sebastian Stüker, Alex Waibel

* 5 pages. To appear in Interspeech 2017 

  Access Paper or Ask Questions

Robust end-to-end deep audiovisual speech recognition


Nov 21, 2016
Ramon Sanabria, Florian Metze, Fernando De La Torre


  Access Paper or Ask Questions