Alert button

"speech recognition": models, code, and papers
Alert button

Multi-Dialect Arabic Speech Recognition

Dec 25, 2021
Abbas Raza Ali

Figure 1 for Multi-Dialect Arabic Speech Recognition
Figure 2 for Multi-Dialect Arabic Speech Recognition
Figure 3 for Multi-Dialect Arabic Speech Recognition
Figure 4 for Multi-Dialect Arabic Speech Recognition
Viaarxiv icon

Korean Tokenization for Beam Search Rescoring in Speech Recognition

Mar 28, 2022
Kyuhong Shim, Hyewon Bae, Wonyong Sung

Figure 1 for Korean Tokenization for Beam Search Rescoring in Speech Recognition
Figure 2 for Korean Tokenization for Beam Search Rescoring in Speech Recognition
Figure 3 for Korean Tokenization for Beam Search Rescoring in Speech Recognition
Figure 4 for Korean Tokenization for Beam Search Rescoring in Speech Recognition
Viaarxiv icon

Adaptive multilingual speech recognition with pretrained models

May 24, 2022
Ngoc-Quan Pham, Alex Waibel, Jan Niehues

Figure 1 for Adaptive multilingual speech recognition with pretrained models
Viaarxiv icon

A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings

Nov 01, 2022
Mohan Shi, Jie Zhang, Zhihao Du, Fan Yu, Shiliang Zhang, Li-Rong Dai

Figure 1 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Figure 2 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Figure 3 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Figure 4 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Viaarxiv icon

Evaluation of Speaker Anonymization on Emotional Speech

Apr 15, 2023
Hubert Nourtel, Pierre Champion, Denis Jouvet, Anthony Larcher, Marie Tahon

Figure 1 for Evaluation of Speaker Anonymization on Emotional Speech
Figure 2 for Evaluation of Speaker Anonymization on Emotional Speech
Figure 3 for Evaluation of Speaker Anonymization on Emotional Speech
Figure 4 for Evaluation of Speaker Anonymization on Emotional Speech
Viaarxiv icon

Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey

Feb 22, 2022
Ngoc Dung Huynh, Mohamed Reda Bouadjenek, Imran Razzak, Kevin Lee, Chetan Arora, Ali Hassani, Arkady Zaslavsky

Figure 1 for Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Figure 2 for Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Figure 3 for Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Figure 4 for Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Viaarxiv icon

Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search

Nov 16, 2022
Zihan Wang, Qi Meng, HaiFeng Lan, XinRui Zhang, KeHao Guo, Akshat Gupta

Figure 1 for Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search
Figure 2 for Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search
Figure 3 for Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search
Figure 4 for Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search
Viaarxiv icon

Similarity and Content-based Phonetic Self Attention for Speech Recognition

Mar 28, 2022
Kyuhong Shim, Wonyong Sung

Figure 1 for Similarity and Content-based Phonetic Self Attention for Speech Recognition
Figure 2 for Similarity and Content-based Phonetic Self Attention for Speech Recognition
Figure 3 for Similarity and Content-based Phonetic Self Attention for Speech Recognition
Figure 4 for Similarity and Content-based Phonetic Self Attention for Speech Recognition
Viaarxiv icon

Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification

May 25, 2023
Gokul Bhusal, Ekaterina Merkurjev, Guo-Wei Wei

Figure 1 for Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification
Figure 2 for Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification
Figure 3 for Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification
Figure 4 for Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification
Viaarxiv icon

Robust Self-Supervised Audio-Visual Speech Recognition

Jan 05, 2022
Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed

Figure 1 for Robust Self-Supervised Audio-Visual Speech Recognition
Figure 2 for Robust Self-Supervised Audio-Visual Speech Recognition
Figure 3 for Robust Self-Supervised Audio-Visual Speech Recognition
Figure 4 for Robust Self-Supervised Audio-Visual Speech Recognition
Viaarxiv icon