Alert button
Picture for Roland Maas

Roland Maas

Alert button

Wav2vec-C: A Self-supervised Model for Speech Representation Learning

Add code
Bookmark button
Alert button
Mar 09, 2021
Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas

Figure 1 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Figure 2 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Figure 3 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Figure 4 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Viaarxiv icon

REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling

Add code
Bookmark button
Alert button
Dec 14, 2020
Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gokce Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas

Figure 1 for REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling
Figure 2 for REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling
Viaarxiv icon

Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition

Add code
Bookmark button
Alert button
Jul 27, 2020
Jinxi Guo, Gautam Tiwari, Jasha Droppo, Maarten Van Segbroeck, Che-Wei Huang, Andreas Stolcke, Roland Maas

Figure 1 for Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition
Figure 2 for Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition
Figure 3 for Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition
Figure 4 for Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition
Viaarxiv icon

Streaming End-to-End Bilingual ASR Systems with Joint Language Identification

Add code
Bookmark button
Alert button
Jul 08, 2020
Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann

Figure 1 for Streaming End-to-End Bilingual ASR Systems with Joint Language Identification
Figure 2 for Streaming End-to-End Bilingual ASR Systems with Joint Language Identification
Figure 3 for Streaming End-to-End Bilingual ASR Systems with Joint Language Identification
Viaarxiv icon

Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jun 30, 2020
Maarten Van Segbroeck, Harish Mallidih, Brian King, I-Fan Chen, Gurpreet Chadha, Roland Maas

Viaarxiv icon

Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses

Add code
Bookmark button
Alert button
Jun 01, 2020
Chander Chandak, Zeynab Raeesy, Ariya Rastrow, Yuzong Liu, Xiangyang Huang, Siyu Wang, Dong Kwon Joo, Roland Maas

Figure 1 for Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses
Figure 2 for Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses
Figure 3 for Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses
Figure 4 for Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses
Viaarxiv icon

DiPCo -- Dinner Party Corpus

Add code
Bookmark button
Alert button
Sep 30, 2019
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas

Figure 1 for DiPCo -- Dinner Party Corpus
Figure 2 for DiPCo -- Dinner Party Corpus
Figure 3 for DiPCo -- Dinner Party Corpus
Figure 4 for DiPCo -- Dinner Party Corpus
Viaarxiv icon

Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning

Add code
Bookmark button
Alert button
Jan 11, 2019
Ladislav Mošner, Minhua Wu, Anirudh Raju, Sree Hari Krishnan Parthasarathi, Kenichi Kumatani, Shiva Sundaram, Roland Maas, Björn Hoffmeister

Figure 1 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Figure 2 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Figure 3 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Figure 4 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Viaarxiv icon

LSTM-based Whisper Detection

Add code
Bookmark button
Alert button
Sep 20, 2018
Zeynab Raeesy, Kellen Gillespie, Chengyuan Ma, Thomas Drugman, Jiacheng Gu, Roland Maas, Ariya Rastrow, Björn Hoffmeister

Figure 1 for LSTM-based Whisper Detection
Figure 2 for LSTM-based Whisper Detection
Figure 3 for LSTM-based Whisper Detection
Figure 4 for LSTM-based Whisper Detection
Viaarxiv icon

Device-directed Utterance Detection

Add code
Bookmark button
Alert button
Aug 07, 2018
Sri Harish Mallidi, Roland Maas, Kyle Goehner, Ariya Rastrow, Spyros Matsoukas, Björn Hoffmeister

Figure 1 for Device-directed Utterance Detection
Figure 2 for Device-directed Utterance Detection
Figure 3 for Device-directed Utterance Detection
Figure 4 for Device-directed Utterance Detection
Viaarxiv icon