Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models


Oct 28, 2022
Ramon Sanabria, Hao Tang, Sharon Goldwater

Add code

* Submitted to IEEE ICASSP 2023 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training


Mar 02, 2022
Ramon Sanabria, Wei-Ning Hsu, Alexei Baevski, Michael Auli

Add code

* Submitted to Insterspeech 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

On the Difficulty of Segmenting Words with Attention


Sep 21, 2021
Ramon Sanabria, Hao Tang, Sharon Goldwater

Add code

* Accepted at the "Workshop on Insights from Negative Results in NLP" (EMNLP 2021) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval


Apr 08, 2021
Ramon Sanabria, Austin Waters, Jason Baldridge

Add code

* Submitted to INTERSPEECH 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Multimodal Speech Recognition with Unstructured Audio Masking


Oct 16, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott

Add code

* Accepted to NLP Beyond Text workshop, EMNLP 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Fine-Grained Grounding for Multimodal Speech Recognition


Oct 05, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott

Add code

* Accepted to Findings of EMNLP 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Looking Enhances Listening: Recovering Missing Speech Using Images


Feb 13, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze

Add code

* Accepted to ICASSP 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Multitask Learning For Different Subword Segmentations In Neural Machine Translation


Oct 27, 2019
Tejas Srinivasan, Ramon Sanabria, Florian Metze

Add code

* Accepted to 16th International Workshop on Spoken Language Translation (IWSLT) 2019 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions


Jun 30, 2019
Tejas Srinivasan, Ramon Sanabria, Florian Metze

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>