Alert button

"speech recognition": models, code, and papers
Alert button

SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech

Add code
Bookmark button
Alert button
Nov 19, 2021
Suwon Shon, Ankita Pasad, Felix Wu, Pablo Brusco, Yoav Artzi, Karen Livescu, Kyu J. Han

Figure 1 for SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech
Figure 2 for SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech
Figure 3 for SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech
Figure 4 for SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech
Viaarxiv icon

A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation

Nov 18, 2021
Tom O'Malley, Arun Narayanan, Quan Wang, Alex Park, James Walker, Nathan Howard

Figure 1 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Figure 2 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Figure 3 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Figure 4 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Viaarxiv icon

ASL Trigger Recognition in Mixed Activity/Signing Sequences for RF Sensor-Based User Interfaces

Add code
Bookmark button
Alert button
Nov 18, 2021
Emre Kurtoglu, Ali C. Gurbuz, Evie A. Malaia, Darrin Griffin, Chris Crawford, Sevgi Z. Gurbuz

Figure 1 for ASL Trigger Recognition in Mixed Activity/Signing Sequences for RF Sensor-Based User Interfaces
Figure 2 for ASL Trigger Recognition in Mixed Activity/Signing Sequences for RF Sensor-Based User Interfaces
Figure 3 for ASL Trigger Recognition in Mixed Activity/Signing Sequences for RF Sensor-Based User Interfaces
Figure 4 for ASL Trigger Recognition in Mixed Activity/Signing Sequences for RF Sensor-Based User Interfaces
Viaarxiv icon

Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition

Sep 17, 2021
Felix Weninger, Marco Gaudesi, Ralf Leibold, Roberto Gemello, Puming Zhan

Figure 1 for Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition
Figure 2 for Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition
Figure 3 for Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition
Figure 4 for Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition
Viaarxiv icon

Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational Layer

Nov 23, 2020
Mohammad Soltanian, Junaid Malik, Jenni Raitoharju, Alexandros Iosifidis, Serkan Kiranyaz, Moncef Gabbouj

Figure 1 for Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational Layer
Figure 2 for Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational Layer
Figure 3 for Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational Layer
Figure 4 for Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational Layer
Viaarxiv icon

Reducing Exposure Bias in Training Recurrent Neural Network Transducers

Aug 24, 2021
Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltan Tuske

Figure 1 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Figure 2 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Figure 3 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Figure 4 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Viaarxiv icon

Joint Unsupervised and Supervised Training for Multilingual ASR

Nov 15, 2021
Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath

Figure 1 for Joint Unsupervised and Supervised Training for Multilingual ASR
Figure 2 for Joint Unsupervised and Supervised Training for Multilingual ASR
Figure 3 for Joint Unsupervised and Supervised Training for Multilingual ASR
Figure 4 for Joint Unsupervised and Supervised Training for Multilingual ASR
Viaarxiv icon

Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement

Add code
Bookmark button
Alert button
Dec 21, 2021
Yichao Du, Zhirui Zhang, Weizhi Wang, Boxing Chen, Jun Xie, Tong Xu

Figure 1 for Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement
Figure 2 for Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement
Figure 3 for Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement
Figure 4 for Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement
Viaarxiv icon

Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users

Oct 25, 2021
Matthieu Riou, Bassam Jabaian, Stéphane Huet, Fabrice Lefèvre

Figure 1 for Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users
Figure 2 for Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users
Figure 3 for Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users
Figure 4 for Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users
Viaarxiv icon

Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts

Add code
Bookmark button
Alert button
Jun 14, 2021
Trang Tran, Mari Ostendorf

Figure 1 for Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts
Figure 2 for Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts
Figure 3 for Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts
Figure 4 for Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts
Viaarxiv icon