Alert button

"speech": models, code, and papers
Alert button

Deep Learning Based Assessment of Synthetic Speech Naturalness

Add code
Bookmark button
Alert button
Apr 23, 2021
Gabriel Mittag, Sebastian Möller

Figure 1 for Deep Learning Based Assessment of Synthetic Speech Naturalness
Figure 2 for Deep Learning Based Assessment of Synthetic Speech Naturalness
Figure 3 for Deep Learning Based Assessment of Synthetic Speech Naturalness
Figure 4 for Deep Learning Based Assessment of Synthetic Speech Naturalness
Viaarxiv icon

Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings

Add code
Bookmark button
Alert button
Jun 08, 2021
Marcely Zanon Boito, Bolaji Yusuf, Lucas Ondel, Aline Villavicencio, Laurent Besacier

Figure 1 for Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings
Figure 2 for Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings
Figure 3 for Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings
Figure 4 for Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings
Viaarxiv icon

GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis

Add code
Bookmark button
Alert button
Jun 29, 2021
Jinhyeok Yang, Jae-Sung Bae, Taejun Bak, Youngik Kim, Hoon-Young Cho

Figure 1 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Figure 2 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Figure 3 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Figure 4 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Viaarxiv icon

Speech Resynthesis from Discrete Disentangled Self-Supervised Representations

Add code
Bookmark button
Alert button
Apr 02, 2021
Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux

Figure 1 for Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Figure 2 for Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Figure 3 for Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Figure 4 for Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Viaarxiv icon

DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep Filtering

Add code
Bookmark button
Alert button
Oct 11, 2021
Hendrik Schröter, Alberto N. Escalante-B., Tobias Rosenkranz, Andreas Maier

Figure 1 for DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep Filtering
Figure 2 for DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep Filtering
Figure 3 for DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep Filtering
Figure 4 for DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep Filtering
Viaarxiv icon

SEMOUR: A Scripted Emotional Speech Repository for Urdu

May 19, 2021
Nimra Zaheer, Obaid Ullah Ahmad, Ammar Ahmed, Muhammad Shehryar Khan, Mudassir Shabbir

Figure 1 for SEMOUR: A Scripted Emotional Speech Repository for Urdu
Figure 2 for SEMOUR: A Scripted Emotional Speech Repository for Urdu
Figure 3 for SEMOUR: A Scripted Emotional Speech Repository for Urdu
Figure 4 for SEMOUR: A Scripted Emotional Speech Repository for Urdu
Viaarxiv icon

Predicting speech intelligibility from EEG using a dilated convolutional network

May 19, 2021
Bernd Accou, Mohammad Jalilpour Monesi, Hugo Van hamme, Tom Francart

Figure 1 for Predicting speech intelligibility from EEG using a dilated convolutional network
Figure 2 for Predicting speech intelligibility from EEG using a dilated convolutional network
Figure 3 for Predicting speech intelligibility from EEG using a dilated convolutional network
Figure 4 for Predicting speech intelligibility from EEG using a dilated convolutional network
Viaarxiv icon

BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance

Add code
Bookmark button
Alert button
Nov 13, 2022
Haotong Qin, Xudong Ma, Yifu Ding, Xiaoyang Li, Yang Zhang, Zejun Ma, Jiakai Wang, Jie Luo, Xianglong Liu

Figure 1 for BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance
Figure 2 for BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance
Figure 3 for BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance
Figure 4 for BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance
Viaarxiv icon

A Review on Part-of-Speech Technologies

Oct 11, 2021
Onyenwe Ikechukwu, Onyedikachukwu Ikechukwu-Onyenwe, Onyedinma Ebele

Figure 1 for A Review on Part-of-Speech Technologies
Figure 2 for A Review on Part-of-Speech Technologies
Figure 3 for A Review on Part-of-Speech Technologies
Viaarxiv icon

Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models

Mar 30, 2022
Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed H. Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed Tewfik

Figure 1 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 2 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 3 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 4 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Viaarxiv icon