Alert button

"speech": models, code, and papers
Alert button

DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors

Oct 08, 2021
Chandan K A Reddy, Vishak Gopal, Ross Cutler

Figure 1 for DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors
Figure 2 for DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors
Figure 3 for DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors
Figure 4 for DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors
Viaarxiv icon

Dictionary-Based Fusion of Contact and Acoustic Microphones for Wind Noise Reduction

May 18, 2022
Marvin Tammen, Xilin Li, Simon Doclo, Lalin Theverapperuma

Figure 1 for Dictionary-Based Fusion of Contact and Acoustic Microphones for Wind Noise Reduction
Figure 2 for Dictionary-Based Fusion of Contact and Acoustic Microphones for Wind Noise Reduction
Viaarxiv icon

SpeechNet: A Universal Modularized Model for Speech Processing Tasks

Add code
Bookmark button
Alert button
May 07, 2021
Yi-Chen Chen, Po-Han Chi, Shu-wen Yang, Kai-Wei Chang, Jheng-hao Lin, Sung-Feng Huang, Da-Rong Liu, Chi-Liang Liu, Cheng-Kuang Lee, Hung-yi Lee

Figure 1 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Figure 2 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Figure 3 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Figure 4 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Viaarxiv icon

Neural Network-augmented Kalman Filtering for Robust Online Speech Dereverberation in Noisy Reverberant Environments

Apr 06, 2022
Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann

Figure 1 for Neural Network-augmented Kalman Filtering for Robust Online Speech Dereverberation in Noisy Reverberant Environments
Figure 2 for Neural Network-augmented Kalman Filtering for Robust Online Speech Dereverberation in Noisy Reverberant Environments
Viaarxiv icon

SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation

Add code
Bookmark button
Alert button
Jul 27, 2022
Artem Ploujnikov, Mirco Ravanelli

Figure 1 for SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation
Figure 2 for SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation
Figure 3 for SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation
Figure 4 for SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation
Viaarxiv icon

Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition

Jun 02, 2021
Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Takafumi Moriya, Naoyuki Kamo

Figure 1 for Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition
Figure 2 for Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition
Figure 3 for Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition
Viaarxiv icon

Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech

Aug 29, 2021
Injy Hamed, Pavel Denisov, Chia-Yu Li, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu

Figure 1 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Figure 2 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Figure 3 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Figure 4 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Viaarxiv icon

Utterance-level neural confidence measure for end-to-end children speech recognition

Sep 16, 2021
Wei Liu, Tan Lee

Figure 1 for Utterance-level neural confidence measure for end-to-end children speech recognition
Figure 2 for Utterance-level neural confidence measure for end-to-end children speech recognition
Figure 3 for Utterance-level neural confidence measure for end-to-end children speech recognition
Figure 4 for Utterance-level neural confidence measure for end-to-end children speech recognition
Viaarxiv icon

Hierarchical Diffusion Models for Singing Voice Neural Vocoder

Add code
Bookmark button
Alert button
Oct 18, 2022
Naoya Takahashi, Mayank Kumar, Singh, Yuki Mitsufuji

Figure 1 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Figure 2 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Figure 3 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Figure 4 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Viaarxiv icon

Dual-Decoder Transformer For end-to-end Mandarin Chinese Speech Recognition with Pinyin and Character

Add code
Bookmark button
Alert button
Jan 26, 2022
Zhao Yang, Wei Xi, Rui Wang, Rui Jiang, Jizhong Zhao

Figure 1 for Dual-Decoder Transformer For end-to-end Mandarin Chinese Speech Recognition with Pinyin and Character
Figure 2 for Dual-Decoder Transformer For end-to-end Mandarin Chinese Speech Recognition with Pinyin and Character
Figure 3 for Dual-Decoder Transformer For end-to-end Mandarin Chinese Speech Recognition with Pinyin and Character
Figure 4 for Dual-Decoder Transformer For end-to-end Mandarin Chinese Speech Recognition with Pinyin and Character
Viaarxiv icon