Alert button

"speech": models, code, and papers
Alert button

Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation

Add code
Bookmark button
Alert button
Feb 03, 2021
Mingke Xu, Fan Zhang, Xiaodong Cui, Wei Zhang

Figure 1 for Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation
Figure 2 for Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation
Figure 3 for Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation
Figure 4 for Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation
Viaarxiv icon

Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers

Add code
Bookmark button
Alert button
Feb 24, 2021
Ishan Sanjeev Upadhyay, Nikhil E, Anshul Wadhawan, Radhika Mamidi

Figure 1 for Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers
Figure 2 for Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers
Figure 3 for Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers
Figure 4 for Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers
Viaarxiv icon

WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation

Apr 09, 2019
Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo

Figure 1 for WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation
Figure 2 for WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation
Figure 3 for WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation
Figure 4 for WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation
Viaarxiv icon

Phonetically-Oriented Word Error Alignment for Speech Recognition Error Analysis in Speech Translation

Add code
Bookmark button
Alert button
Apr 24, 2019
Nicholas Ruiz, Marcello Federico

Figure 1 for Phonetically-Oriented Word Error Alignment for Speech Recognition Error Analysis in Speech Translation
Figure 2 for Phonetically-Oriented Word Error Alignment for Speech Recognition Error Analysis in Speech Translation
Figure 3 for Phonetically-Oriented Word Error Alignment for Speech Recognition Error Analysis in Speech Translation
Figure 4 for Phonetically-Oriented Word Error Alignment for Speech Recognition Error Analysis in Speech Translation
Viaarxiv icon

Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement

Add code
Bookmark button
Alert button
Nov 14, 2019
Soumi Maiti, Michael I Mandel

Figure 1 for Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement
Figure 2 for Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement
Figure 3 for Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement
Figure 4 for Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement
Viaarxiv icon

Training Efficient CNNS: Tweaking the Nuts and Bolts of Neural Networks for Lighter, Faster and Robust Models

May 23, 2022
Sabeesh Ethiraj, Bharath Kumar Bolla

Figure 1 for Training Efficient CNNS: Tweaking the Nuts and Bolts of Neural Networks for Lighter, Faster and Robust Models
Figure 2 for Training Efficient CNNS: Tweaking the Nuts and Bolts of Neural Networks for Lighter, Faster and Robust Models
Figure 3 for Training Efficient CNNS: Tweaking the Nuts and Bolts of Neural Networks for Lighter, Faster and Robust Models
Figure 4 for Training Efficient CNNS: Tweaking the Nuts and Bolts of Neural Networks for Lighter, Faster and Robust Models
Viaarxiv icon

Residual Energy-Based Models for End-to-End Speech Recognition

Mar 25, 2021
Qiujia Li, Yu Zhang, Bo Li, Liangliang Cao, Philip C. Woodland

Figure 1 for Residual Energy-Based Models for End-to-End Speech Recognition
Figure 2 for Residual Energy-Based Models for End-to-End Speech Recognition
Figure 3 for Residual Energy-Based Models for End-to-End Speech Recognition
Figure 4 for Residual Energy-Based Models for End-to-End Speech Recognition
Viaarxiv icon

Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge

May 24, 2020
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

Figure 1 for Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Figure 2 for Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Figure 3 for Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Figure 4 for Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Viaarxiv icon

Detecting Unintended Memorization in Language-Model-Fused ASR

Apr 20, 2022
W. Ronny Huang, Steve Chien, Om Thakkar, Rajiv Mathews

Figure 1 for Detecting Unintended Memorization in Language-Model-Fused ASR
Figure 2 for Detecting Unintended Memorization in Language-Model-Fused ASR
Figure 3 for Detecting Unintended Memorization in Language-Model-Fused ASR
Figure 4 for Detecting Unintended Memorization in Language-Model-Fused ASR
Viaarxiv icon

The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results

Jul 12, 2020
Xian Shi, Qiangze Feng, Lei Xie

Figure 1 for The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results
Figure 2 for The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results
Figure 3 for The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results
Figure 4 for The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results
Viaarxiv icon