Alert button

"speech recognition": models, code, and papers
Alert button

hierarchical network with decoupled knowledge distillation for speech emotion recognition

Mar 09, 2023
Ziping Zhao, Huan Wang, Haishuai Wang, Bjorn Schuller

Figure 1 for hierarchical network with decoupled knowledge distillation for speech emotion recognition
Figure 2 for hierarchical network with decoupled knowledge distillation for speech emotion recognition
Figure 3 for hierarchical network with decoupled knowledge distillation for speech emotion recognition
Figure 4 for hierarchical network with decoupled knowledge distillation for speech emotion recognition
Viaarxiv icon

Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification

Sep 13, 2022
Chao Zhang, Bo Li, Tara Sainath, Trevor Strohman, Sepand Mavandadi, Shuo-yiin Chang, Parisa Haghani

Figure 1 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Figure 2 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Figure 3 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Figure 4 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Viaarxiv icon

Bangla-Wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models

Sep 13, 2022
Mohammed Rakib, Md. Ismail Hossain, Nabeel Mohammed, Fuad Rahman

Figure 1 for Bangla-Wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models
Figure 2 for Bangla-Wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models
Figure 3 for Bangla-Wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models
Viaarxiv icon

Interactive spatial speech recognition maps based on simulated speech recognition experiments

Apr 01, 2021
Marc René Schädler

Figure 1 for Interactive spatial speech recognition maps based on simulated speech recognition experiments
Figure 2 for Interactive spatial speech recognition maps based on simulated speech recognition experiments
Figure 3 for Interactive spatial speech recognition maps based on simulated speech recognition experiments
Figure 4 for Interactive spatial speech recognition maps based on simulated speech recognition experiments
Viaarxiv icon

Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents

Apr 03, 2022
Priyank Dubey, Bilal Shah

Viaarxiv icon

Who Needs Decoders? Efficient Estimation of Sequence-level Attributes

May 09, 2023
Yassir Fathullah, Puria Radmard, Adian Liusie, Mark J. F. Gales

Figure 1 for Who Needs Decoders? Efficient Estimation of Sequence-level Attributes
Figure 2 for Who Needs Decoders? Efficient Estimation of Sequence-level Attributes
Figure 3 for Who Needs Decoders? Efficient Estimation of Sequence-level Attributes
Figure 4 for Who Needs Decoders? Efficient Estimation of Sequence-level Attributes
Viaarxiv icon

Heterogeneous Reservoir Computing Models for Persian Speech Recognition

May 25, 2022
Zohreh Ansari, Farzin Pourhoseini, Fatemeh Hadaeghi

Figure 1 for Heterogeneous Reservoir Computing Models for Persian Speech Recognition
Figure 2 for Heterogeneous Reservoir Computing Models for Persian Speech Recognition
Figure 3 for Heterogeneous Reservoir Computing Models for Persian Speech Recognition
Figure 4 for Heterogeneous Reservoir Computing Models for Persian Speech Recognition
Viaarxiv icon

Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations

May 18, 2023
Weiwei Lin, Chenhang He, Man-Wai Mak, Youzhi Tu

Figure 1 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Figure 2 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Figure 3 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Figure 4 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Viaarxiv icon

A Lexical-aware Non-autoregressive Transformer-based ASR Model

May 18, 2023
Chong-En Lin, Kuan-Yu Chen

Figure 1 for A Lexical-aware Non-autoregressive Transformer-based ASR Model
Figure 2 for A Lexical-aware Non-autoregressive Transformer-based ASR Model
Figure 3 for A Lexical-aware Non-autoregressive Transformer-based ASR Model
Viaarxiv icon

Multi-Level Modeling Units for End-to-End Mandarin Speech Recognition

May 25, 2022
Yuting Yang, Binbin Du, Yuke Li

Figure 1 for Multi-Level Modeling Units for End-to-End Mandarin Speech Recognition
Figure 2 for Multi-Level Modeling Units for End-to-End Mandarin Speech Recognition
Figure 3 for Multi-Level Modeling Units for End-to-End Mandarin Speech Recognition
Figure 4 for Multi-Level Modeling Units for End-to-End Mandarin Speech Recognition
Viaarxiv icon