Alert button

"speech recognition": models, code, and papers
Alert button

Challenges and Opportunities of Speech Recognition for Bengali Language

Sep 27, 2021
M. F. Mridha, Abu Quwsar Ohi, Md. Abdul Hamid, Muhammad Mostafa Monowar

Figure 1 for Challenges and Opportunities of Speech Recognition for Bengali Language
Figure 2 for Challenges and Opportunities of Speech Recognition for Bengali Language
Figure 3 for Challenges and Opportunities of Speech Recognition for Bengali Language
Figure 4 for Challenges and Opportunities of Speech Recognition for Bengali Language
Viaarxiv icon

Evil Operation: Breaking Speaker Recognition with PaddingBack

Add code
Bookmark button
Alert button
Aug 08, 2023
Zhe Ye, Diqun Yan, Li Dong, Kailai Shen

Figure 1 for Evil Operation: Breaking Speaker Recognition with PaddingBack
Figure 2 for Evil Operation: Breaking Speaker Recognition with PaddingBack
Figure 3 for Evil Operation: Breaking Speaker Recognition with PaddingBack
Figure 4 for Evil Operation: Breaking Speaker Recognition with PaddingBack
Viaarxiv icon

Bayesian Neural Network Language Modeling for Speech Recognition

Add code
Bookmark button
Alert button
Aug 28, 2022
Boyang Xue, Shoukang Hu, Junhao Xu, Mengzhe Geng, Xunying Liu, Helen Meng

Figure 1 for Bayesian Neural Network Language Modeling for Speech Recognition
Figure 2 for Bayesian Neural Network Language Modeling for Speech Recognition
Figure 3 for Bayesian Neural Network Language Modeling for Speech Recognition
Figure 4 for Bayesian Neural Network Language Modeling for Speech Recognition
Viaarxiv icon

Analysis of EEG frequency bands for Envisioned Speech Recognition

Add code
Bookmark button
Alert button
Mar 29, 2022
Ayush Tripathi

Figure 1 for Analysis of EEG frequency bands for Envisioned Speech Recognition
Figure 2 for Analysis of EEG frequency bands for Envisioned Speech Recognition
Figure 3 for Analysis of EEG frequency bands for Envisioned Speech Recognition
Figure 4 for Analysis of EEG frequency bands for Envisioned Speech Recognition
Viaarxiv icon

SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?

Add code
Bookmark button
Alert button
Jun 14, 2023
Takanori Ashihara, Takafumi Moriya, Kohei Matsuura, Tomohiro Tanaka, Yusuke Ijima, Taichi Asami, Marc Delcroix, Yukinori Honma

Figure 1 for SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?
Figure 2 for SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?
Figure 3 for SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?
Figure 4 for SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?
Viaarxiv icon

Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks

May 04, 2023
Yun Tang, Anna Y. Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden D. Tomasello, Juan Pino

Figure 1 for Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks
Figure 2 for Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks
Figure 3 for Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks
Figure 4 for Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks
Viaarxiv icon

Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition

Jul 29, 2022
Peng Shen, Xugang Lu, Hisashi Kawai

Figure 1 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Figure 2 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Figure 3 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Figure 4 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Viaarxiv icon

End-to-end Speech-to-Punctuated-Text Recognition

Jul 07, 2022
Jumon Nozaki, Tatsuya Kawahara, Kenkichi Ishizuka, Taiichi Hashimoto

Figure 1 for End-to-end Speech-to-Punctuated-Text Recognition
Figure 2 for End-to-end Speech-to-Punctuated-Text Recognition
Figure 3 for End-to-end Speech-to-Punctuated-Text Recognition
Figure 4 for End-to-end Speech-to-Punctuated-Text Recognition
Viaarxiv icon

On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question Answering

Add code
Bookmark button
Alert button
Sep 26, 2022
Georgios Sidiropoulos, Svitlana Vakulenko, Evangelos Kanoulas

Figure 1 for On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question Answering
Figure 2 for On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question Answering
Figure 3 for On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question Answering
Figure 4 for On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question Answering
Viaarxiv icon

Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations

May 18, 2023
Weiwei Lin, Chenhang He, Man-Wai Mak, Youzhi Tu

Figure 1 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Figure 2 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Figure 3 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Figure 4 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Viaarxiv icon