Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Self-Supervised Speech Representation Learning: A Review



Abdelrahman Mohamed , Hung-yi Lee , Lasse Borgholt , Jakob D. Havtorn , Joakim Edin , Christian Igel , Katrin Kirchhoff , Shang-Wen Li , Karen Livescu , Lars Maaløe , Tara N. Sainath , Shinji Watanabe


   Access Paper or Ask Questions

Benchmarking Generative Latent Variable Models for Speech



Jakob D. Havtorn , Lasse Borgholt , Søren Hauberg , Jes Frellsen , Lars Maaløe

* Accepted at the 2022 ICLR workshop on Deep Generative Models for Highly Structured Data (https://deep-gen-struct.github.io

   Access Paper or Ask Questions

A Brief Overview of Unsupervised Neural Speech Representation Learning



Lasse Borgholt , Jakob Drachmann Havtorn , Joakim Edin , Lars Maaløe , Christian Igel

* The 2nd Workshop on Self-supervised Learning for Audio and Speech Processing (SAS) at AAAI 

   Access Paper or Ask Questions

Do We Still Need Automatic Speech Recognition for Spoken Language Understanding?



Lasse Borgholt , Jakob Drachmann Havtorn , Mostafa Abdou , Joakim Edin , Lars Maaløe , Anders Søgaard , Christian Igel

* Under review as a conference paper at ICASSP 2022 

   Access Paper or Ask Questions

Do End-to-End Speech Recognition Models Care About Context?



Lasse Borgholt , Jakob Drachmann Havtorn , Željko Agić , Anders Søgaard , Lars Maaløe , Christian Igel

* Published in the proceedings of INTERSPEECH 2020, pp. 4352-4356 

   Access Paper or Ask Questions

On Scaling Contrastive Representations for Low-Resource Speech Recognition



Lasse Borgholt , Tycho Max Sylvester Tax , Jakob Drachmann Havtorn , Lars Maaløe , Christian Igel

* {\copyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works 

   Access Paper or Ask Questions

MultiQT: Multimodal Learning for Real-Time Question Tracking in Speech



Jakob D. Havtorn , Jan Latko , Joakim Edin , Lasse Borgholt , Lars Maaløe , Lorenzo Belgrano , Nicolai F. Jacobsen , Regitze Sdun , Željko Agić

* Accepted at ACL 2020 

   Access Paper or Ask Questions

On the Inductive Bias of Word-Character-Level Multi-Task Learning for Speech Recognition



Jan Kremer , Lasse Borgholt , Lars Maaløe

* Accepted at the IRASL workshop at NeurIPS 2018 

   Access Paper or Ask Questions

1
2
>>