Alert button

"speech recognition": models, code, and papers
Alert button

Multi-objective Recurrent Neural Networks Optimization for the Edge -- a Quantization-based Approach

Aug 02, 2021
Nesma M. Rezk, Tomas Nordström, Dimitrios Stathis, Zain Ul-Abdin, Eren Erdal Aksoy, Ahmed Hemani

Figure 1 for Multi-objective Recurrent Neural Networks Optimization for the Edge -- a Quantization-based Approach
Figure 2 for Multi-objective Recurrent Neural Networks Optimization for the Edge -- a Quantization-based Approach
Figure 3 for Multi-objective Recurrent Neural Networks Optimization for the Edge -- a Quantization-based Approach
Figure 4 for Multi-objective Recurrent Neural Networks Optimization for the Edge -- a Quantization-based Approach
Viaarxiv icon

Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition

Oct 28, 2020
Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Zhengqi wen

Figure 1 for Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition
Figure 2 for Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition
Figure 3 for Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition
Figure 4 for Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition
Viaarxiv icon

Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech

Add code
Bookmark button
Alert button
May 10, 2021
Pengwei Wang, Xin Ye, Xiaohuan Zhou, Jinghui Xie, Hao Wang

Figure 1 for Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Figure 2 for Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Figure 3 for Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Figure 4 for Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Viaarxiv icon

Bridging the Gap Between Clean Data Training and Real-World Inference for Spoken Language Understanding

Add code
Bookmark button
Alert button
Apr 13, 2021
Di Wu, Yiren Chen, Liang Ding, Dacheng Tao

Figure 1 for Bridging the Gap Between Clean Data Training and Real-World Inference for Spoken Language Understanding
Figure 2 for Bridging the Gap Between Clean Data Training and Real-World Inference for Spoken Language Understanding
Figure 3 for Bridging the Gap Between Clean Data Training and Real-World Inference for Spoken Language Understanding
Figure 4 for Bridging the Gap Between Clean Data Training and Real-World Inference for Spoken Language Understanding
Viaarxiv icon

Experiments of ASR-based mispronunciation detection for children and adult English learners

Apr 13, 2021
Nina Hosseini-Kivanani, Roberto Gretter, Marco Matassoni, Giuseppe Daniele Falavigna

Figure 1 for Experiments of ASR-based mispronunciation detection for children and adult English learners
Figure 2 for Experiments of ASR-based mispronunciation detection for children and adult English learners
Figure 3 for Experiments of ASR-based mispronunciation detection for children and adult English learners
Viaarxiv icon

Lexical Access Model for Italian -- Modeling human speech processing: identification of words in running speech toward lexical access based on the detection of landmarks and other acoustic cues to features

Jun 24, 2021
Maria-Gabriella Di Benedetto, Stefanie Shattuck-Hufnagel, Jeung-Yoon Choi, Luca De Nardis, Javier Arango, Ian Chan, Alec DeCaprio

Figure 1 for Lexical Access Model for Italian -- Modeling human speech processing: identification of words in running speech toward lexical access based on the detection of landmarks and other acoustic cues to features
Figure 2 for Lexical Access Model for Italian -- Modeling human speech processing: identification of words in running speech toward lexical access based on the detection of landmarks and other acoustic cues to features
Figure 3 for Lexical Access Model for Italian -- Modeling human speech processing: identification of words in running speech toward lexical access based on the detection of landmarks and other acoustic cues to features
Figure 4 for Lexical Access Model for Italian -- Modeling human speech processing: identification of words in running speech toward lexical access based on the detection of landmarks and other acoustic cues to features
Viaarxiv icon

Knowledge Distillation for Improved Accuracy in Spoken Question Answering

Oct 21, 2020
Chenyu You, Nuo Chen, Yuexian Zou

Figure 1 for Knowledge Distillation for Improved Accuracy in Spoken Question Answering
Figure 2 for Knowledge Distillation for Improved Accuracy in Spoken Question Answering
Figure 3 for Knowledge Distillation for Improved Accuracy in Spoken Question Answering
Figure 4 for Knowledge Distillation for Improved Accuracy in Spoken Question Answering
Viaarxiv icon

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems

Add code
Bookmark button
Alert button
Apr 08, 2021
Juan Zuluaga-Gomez, Iuliia Nigmatulina, Amrutha Prasad, Petr Motlicek, Karel Veselý, Martin Kocour, Igor Szöke

Figure 1 for Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems
Figure 2 for Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems
Figure 3 for Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems
Figure 4 for Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems
Viaarxiv icon

Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency

Apr 08, 2021
Jinchuan Tian, Rongzhi Gu, Helin Wang, Yuexian Zou

Figure 1 for Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency
Figure 2 for Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency
Figure 3 for Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency
Figure 4 for Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency
Viaarxiv icon

An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios

Mar 08, 2021
Emmanuel Hardy, Franck Badets

Figure 1 for An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios
Figure 2 for An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios
Figure 3 for An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios
Figure 4 for An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios
Viaarxiv icon