Alert button

"speech": models, code, and papers
Alert button

Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism

Jul 02, 2022
Kun Wei, Pengcheng Guo, Ning Jiang

Figure 1 for Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
Figure 2 for Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
Figure 3 for Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
Figure 4 for Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
Viaarxiv icon

SUPERB: Speech processing Universal PERformance Benchmark

Add code
Bookmark button
Alert button
May 03, 2021
Shu-wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee

Figure 1 for SUPERB: Speech processing Universal PERformance Benchmark
Figure 2 for SUPERB: Speech processing Universal PERformance Benchmark
Viaarxiv icon

Quantifying Bias in Automatic Speech Recognition

Add code
Bookmark button
Alert button
Apr 01, 2021
Siyuan Feng, Olya Kudina, Bence Mark Halpern, Odette Scharenborg

Figure 1 for Quantifying Bias in Automatic Speech Recognition
Figure 2 for Quantifying Bias in Automatic Speech Recognition
Figure 3 for Quantifying Bias in Automatic Speech Recognition
Figure 4 for Quantifying Bias in Automatic Speech Recognition
Viaarxiv icon

BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds

Add code
Bookmark button
Alert button
Oct 18, 2022
Youshan Zhang, Jialu Li

Figure 1 for BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Figure 2 for BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Figure 3 for BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Figure 4 for BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Viaarxiv icon

A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech

Jun 15, 2021
Pu Wang, Bagher BabaAli, Hugo Van hamme

Figure 1 for A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
Figure 2 for A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
Figure 3 for A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
Figure 4 for A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
Viaarxiv icon

Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech

Jan 21, 2021
Takuya Fujimura, Yuma Koizumi, Kohei Yatabe, Ryoichi Miyazaki

Figure 1 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Figure 2 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Figure 3 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Figure 4 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Viaarxiv icon

AECMOS: A speech quality assessment metric for echo impairment

Add code
Bookmark button
Alert button
Oct 08, 2021
Marju Purin, Sten Sootla, Mateja Sponza, Ando Saabas, Ross Cutler

Figure 1 for AECMOS: A speech quality assessment metric for echo impairment
Figure 2 for AECMOS: A speech quality assessment metric for echo impairment
Figure 3 for AECMOS: A speech quality assessment metric for echo impairment
Figure 4 for AECMOS: A speech quality assessment metric for echo impairment
Viaarxiv icon

UWSpeech: Speech to Speech Translation for Unwritten Languages

Add code
Bookmark button
Alert button
Jun 14, 2020
Chen Zhang, Xu Tan, Yi Ren, Tao Qin, Kejun Zhang, Tie-Yan Liu

Figure 1 for UWSpeech: Speech to Speech Translation for Unwritten Languages
Figure 2 for UWSpeech: Speech to Speech Translation for Unwritten Languages
Figure 3 for UWSpeech: Speech to Speech Translation for Unwritten Languages
Figure 4 for UWSpeech: Speech to Speech Translation for Unwritten Languages
Viaarxiv icon

Is Attention always needed? A Case Study on Language Identification from Speech

Oct 05, 2021
Atanu Mandal, Santanu Pal, Indranil Dutta, Mahidas Bhattacharya, Sudip Kumar Naskar

Figure 1 for Is Attention always needed? A Case Study on Language Identification from Speech
Figure 2 for Is Attention always needed? A Case Study on Language Identification from Speech
Figure 3 for Is Attention always needed? A Case Study on Language Identification from Speech
Figure 4 for Is Attention always needed? A Case Study on Language Identification from Speech
Viaarxiv icon

Enhancing audio quality for expressive Neural Text-to-Speech

Aug 13, 2021
Abdelhamid Ezzerg, Adam Gabrys, Bartosz Putrycz, Daniel Korzekwa, Daniel Saez-Trigueros, David McHardy, Kamil Pokora, Jakub Lachowicz, Jaime Lorenzo-Trueba, Viacheslav Klimkov

Figure 1 for Enhancing audio quality for expressive Neural Text-to-Speech
Figure 2 for Enhancing audio quality for expressive Neural Text-to-Speech
Figure 3 for Enhancing audio quality for expressive Neural Text-to-Speech
Figure 4 for Enhancing audio quality for expressive Neural Text-to-Speech
Viaarxiv icon