Alert button

"speech": models, code, and papers
Alert button

Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches

Jul 07, 2022
Tusarkanta Dalai, Tapas Kumar Mishra, Pankaj K Sa

Figure 1 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 2 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 3 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 4 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Viaarxiv icon

Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada

Jul 27, 2022
Madhavaraj A, Bharathi Pilar, Ramakrishnan A G

Figure 1 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Figure 2 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Figure 3 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Figure 4 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Viaarxiv icon

AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios

Add code
Bookmark button
Alert button
Apr 01, 2022
Yihan Wu, Xu Tan, Bohan Li, Lei He, Sheng Zhao, Ruihua Song, Tao Qin, Tie-Yan Liu

Figure 1 for AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios
Figure 2 for AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios
Figure 3 for AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios
Figure 4 for AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios
Viaarxiv icon

Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition

Feb 26, 2022
Mengzhe Geng, Xurong Xie, Zi Ye, Tianzi Wang, Guinan Li, Shujie Hu, Xunying Liu, Helen Meng

Figure 1 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Figure 2 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Figure 3 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Figure 4 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Viaarxiv icon

Single-channel speech enhancement by using psychoacoustical model inspired fusion framework

Feb 10, 2022
Suman Samui

Viaarxiv icon

Autoregressive Co-Training for Learning Discrete Speech Representations

Add code
Bookmark button
Alert button
Mar 29, 2022
Sung-Lin Yeh, Hao Tang

Figure 1 for Autoregressive Co-Training for Learning Discrete Speech Representations
Figure 2 for Autoregressive Co-Training for Learning Discrete Speech Representations
Figure 3 for Autoregressive Co-Training for Learning Discrete Speech Representations
Figure 4 for Autoregressive Co-Training for Learning Discrete Speech Representations
Viaarxiv icon

Exploring Attention Map Reuse for Efficient Transformer Neural Networks

Jan 29, 2023
Kyuhong Shim, Jungwook Choi, Wonyong Sung

Figure 1 for Exploring Attention Map Reuse for Efficient Transformer Neural Networks
Figure 2 for Exploring Attention Map Reuse for Efficient Transformer Neural Networks
Figure 3 for Exploring Attention Map Reuse for Efficient Transformer Neural Networks
Figure 4 for Exploring Attention Map Reuse for Efficient Transformer Neural Networks
Viaarxiv icon

Analysis of Joint Speech-Text Embeddings for Semantic Matching

Apr 04, 2022
Muhammad Huzaifah, Ivan Kukanov

Figure 1 for Analysis of Joint Speech-Text Embeddings for Semantic Matching
Figure 2 for Analysis of Joint Speech-Text Embeddings for Semantic Matching
Figure 3 for Analysis of Joint Speech-Text Embeddings for Semantic Matching
Figure 4 for Analysis of Joint Speech-Text Embeddings for Semantic Matching
Viaarxiv icon

Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index

Nov 15, 2021
Andreas Jonas Fuglsig, Jan Østergaard, Jesper Jensen, Lars Søndergaard Bertelsen, Peter Mariager, Zheng-Hua Tan

Figure 1 for Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index
Figure 2 for Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index
Figure 3 for Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index
Viaarxiv icon

MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech

Add code
Bookmark button
Alert button
Oct 12, 2021
Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao

Figure 1 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Figure 2 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Figure 3 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Figure 4 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Viaarxiv icon