Alert button

"speech": models, code, and papers
Alert button

Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy

Add code
Bookmark button
Alert button
Oct 20, 2022
Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu

Figure 1 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 2 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 3 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 4 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Viaarxiv icon

An Adapter based Multi-label Pre-training for Speech Separation and Enhancement

Nov 11, 2022
Tianrui Wang, Xie Chen, Zhuo Chen, Shu Yu, Weibin Zhu

Figure 1 for An Adapter based Multi-label Pre-training for Speech Separation and Enhancement
Figure 2 for An Adapter based Multi-label Pre-training for Speech Separation and Enhancement
Figure 3 for An Adapter based Multi-label Pre-training for Speech Separation and Enhancement
Figure 4 for An Adapter based Multi-label Pre-training for Speech Separation and Enhancement
Viaarxiv icon

Cross-Modal Mutual Learning for Cued Speech Recognition

Dec 02, 2022
Lei Liu, Li Liu

Figure 1 for Cross-Modal Mutual Learning for Cued Speech Recognition
Figure 2 for Cross-Modal Mutual Learning for Cued Speech Recognition
Figure 3 for Cross-Modal Mutual Learning for Cued Speech Recognition
Figure 4 for Cross-Modal Mutual Learning for Cued Speech Recognition
Viaarxiv icon

Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses

Add code
Bookmark button
Alert button
Nov 29, 2022
Yang Ai, Zhen-Hua Ling

Figure 1 for Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses
Figure 2 for Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses
Figure 3 for Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses
Figure 4 for Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses
Viaarxiv icon

Deep Speech Synthesis from Articulatory Representations

Add code
Bookmark button
Alert button
Sep 13, 2022
Peter Wu, Shinji Watanabe, Louis Goldstein, Alan W Black, Gopala K. Anumanchipalli

Figure 1 for Deep Speech Synthesis from Articulatory Representations
Figure 2 for Deep Speech Synthesis from Articulatory Representations
Figure 3 for Deep Speech Synthesis from Articulatory Representations
Figure 4 for Deep Speech Synthesis from Articulatory Representations
Viaarxiv icon

EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance

Add code
Bookmark button
Alert button
Nov 17, 2022
Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Figure 1 for EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Figure 2 for EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Figure 3 for EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Figure 4 for EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Viaarxiv icon

Efficiently Trained Mongolian Text-to-Speech System Based On FullConv

Oct 24, 2022
ZiQi Liang

Figure 1 for Efficiently Trained Mongolian Text-to-Speech System Based On FullConv
Figure 2 for Efficiently Trained Mongolian Text-to-Speech System Based On FullConv
Figure 3 for Efficiently Trained Mongolian Text-to-Speech System Based On FullConv
Figure 4 for Efficiently Trained Mongolian Text-to-Speech System Based On FullConv
Viaarxiv icon

E2E Spoken Entity Extraction for Virtual Agents

Add code
Bookmark button
Alert button
Mar 01, 2023
Karan Singla, Yeon-Jun Kim, Ryan Price, Shahab Jalalvand, Srinivas Bangalore

Figure 1 for E2E Spoken Entity Extraction for Virtual Agents
Figure 2 for E2E Spoken Entity Extraction for Virtual Agents
Figure 3 for E2E Spoken Entity Extraction for Virtual Agents
Figure 4 for E2E Spoken Entity Extraction for Virtual Agents
Viaarxiv icon

Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection

Oct 31, 2022
Luigi Attorresi, Davide Salvi, Clara Borrelli, Paolo Bestagini, Stefano Tubaro

Figure 1 for Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection
Figure 2 for Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection
Figure 3 for Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection
Figure 4 for Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection
Viaarxiv icon

Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces

Add code
Bookmark button
Alert button
May 27, 2023
Osman Berke Guney, Deniz Kucukahmetler, Huseyin Ozkan

Figure 1 for Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces
Figure 2 for Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces
Figure 3 for Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces
Figure 4 for Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces
Viaarxiv icon