Alert button

"speech": models, code, and papers
Alert button

A review of discourse and conversation impairments in patients with dementia

Nov 15, 2022
Charalambos Themistocleous

Viaarxiv icon

BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing

Jan 25, 2023
Jiali Wei, Ming Fan, Wenjing Jiao, Wuxia Jin, Ting Liu

Figure 1 for BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing
Figure 2 for BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing
Figure 3 for BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing
Figure 4 for BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing
Viaarxiv icon

Non-Parametric Domain Adaptation for End-to-End Speech Translation

May 27, 2022
Yichao Du, Weizhi Wang, Zhirui Zhang, Boxing Chen, Tong Xu, Jun Xie, Enhong Chen

Figure 1 for Non-Parametric Domain Adaptation for End-to-End Speech Translation
Figure 2 for Non-Parametric Domain Adaptation for End-to-End Speech Translation
Figure 3 for Non-Parametric Domain Adaptation for End-to-End Speech Translation
Figure 4 for Non-Parametric Domain Adaptation for End-to-End Speech Translation
Viaarxiv icon

ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech

Feb 16, 2022
Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao

Figure 1 for ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech
Figure 2 for ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech
Figure 3 for ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech
Viaarxiv icon

Unsupervised word-level prosody tagging for controllable speech synthesis

Feb 16, 2022
Yiwei Guo, Chenpeng Du, Kai Yu

Figure 1 for Unsupervised word-level prosody tagging for controllable speech synthesis
Figure 2 for Unsupervised word-level prosody tagging for controllable speech synthesis
Figure 3 for Unsupervised word-level prosody tagging for controllable speech synthesis
Figure 4 for Unsupervised word-level prosody tagging for controllable speech synthesis
Viaarxiv icon

Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis

Apr 06, 2022
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Shiyin Kang, Helen Meng

Figure 1 for Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
Figure 2 for Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
Figure 3 for Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
Figure 4 for Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
Viaarxiv icon

Towards visually prompted keyword localisation for zero-resource spoken languages

Oct 12, 2022
Leanne Nortje, Herman Kamper

Figure 1 for Towards visually prompted keyword localisation for zero-resource spoken languages
Figure 2 for Towards visually prompted keyword localisation for zero-resource spoken languages
Figure 3 for Towards visually prompted keyword localisation for zero-resource spoken languages
Figure 4 for Towards visually prompted keyword localisation for zero-resource spoken languages
Viaarxiv icon

Low-latency Monaural Speech Enhancement with Deep Filter-bank Equalizer

Feb 14, 2022
Chengshi Zheng, Wenzhe Liu, Andong Li, Yuxuan Ke, Xiaodong Li

Figure 1 for Low-latency Monaural Speech Enhancement with Deep Filter-bank Equalizer
Figure 2 for Low-latency Monaural Speech Enhancement with Deep Filter-bank Equalizer
Figure 3 for Low-latency Monaural Speech Enhancement with Deep Filter-bank Equalizer
Figure 4 for Low-latency Monaural Speech Enhancement with Deep Filter-bank Equalizer
Viaarxiv icon

Cognitive Coding of Speech

Oct 08, 2021
Reza Lotfidereshgi, Philippe Gournay

Figure 1 for Cognitive Coding of Speech
Figure 2 for Cognitive Coding of Speech
Figure 3 for Cognitive Coding of Speech
Viaarxiv icon

MixupE: Understanding and Improving Mixup from Directional Derivative Perspective

Dec 29, 2022
Vikas Verma, Sarthak Mittal, Wai Hoh Tang, Hieu Pham, Juho Kannala, Yoshua Bengio, Arno Solin, Kenji Kawaguchi

Figure 1 for MixupE: Understanding and Improving Mixup from Directional Derivative Perspective
Figure 2 for MixupE: Understanding and Improving Mixup from Directional Derivative Perspective
Figure 3 for MixupE: Understanding and Improving Mixup from Directional Derivative Perspective
Figure 4 for MixupE: Understanding and Improving Mixup from Directional Derivative Perspective
Viaarxiv icon