Alert button

"speech": models, code, and papers
Alert button

On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis

Add code
Bookmark button
Alert button
Jul 11, 2023
Siyang Wang, Gustav Eje Henter, Joakim Gustafson, Éva Székely

Figure 1 for On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis
Figure 2 for On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis
Figure 3 for On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis
Figure 4 for On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis
Viaarxiv icon

Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Aug 03, 2023
Shaoshi Ling, Yuxuan Hu, Shuangbei Qian, Guoli Ye, Yao Qian, Yifan Gong, Ed Lin, Michael Zeng

Figure 1 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 2 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 3 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 4 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Viaarxiv icon

LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism

Oct 17, 2023
Yu Chen, Xinyuan Qian, Zexu Pan, Kainan Chen, Haizhou Li

Viaarxiv icon

Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection

Aug 31, 2023
Fatma Elsafoury

Figure 1 for Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection
Viaarxiv icon

Causality Guided Disentanglement for Cross-Platform Hate Speech Detection

Aug 10, 2023
Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

Figure 1 for Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Figure 2 for Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Figure 3 for Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Figure 4 for Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Viaarxiv icon

Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation

Oct 31, 2023
Yanir Maymon, Israel Nelken, Boaz Rafaely

Figure 1 for Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation
Figure 2 for Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation
Figure 3 for Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation
Figure 4 for Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation
Viaarxiv icon

Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model

Aug 11, 2023
Fan Zhang, Naye Ji, Fuxing Gao, Siyuan Zhao, Zhaohan Wang, Shunman Li

Figure 1 for Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Figure 2 for Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Figure 3 for Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Figure 4 for Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Viaarxiv icon

MyVoice: Arabic Speech Resource Collaboration Platform

Jul 23, 2023
Yousseif Elshahawy, Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

Figure 1 for MyVoice: Arabic Speech Resource Collaboration Platform
Figure 2 for MyVoice: Arabic Speech Resource Collaboration Platform
Figure 3 for MyVoice: Arabic Speech Resource Collaboration Platform
Figure 4 for MyVoice: Arabic Speech Resource Collaboration Platform
Viaarxiv icon

Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading

Add code
Bookmark button
Alert button
Oct 08, 2023
Songtao Luo, Shuang Yang, Shiguang Shan, Xilin Chen

Figure 1 for Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading
Figure 2 for Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading
Figure 3 for Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading
Figure 4 for Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading
Viaarxiv icon

Audio-visual video-to-speech synthesis with synthesized input audio

Add code
Bookmark button
Alert button
Jul 31, 2023
Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic

Figure 1 for Audio-visual video-to-speech synthesis with synthesized input audio
Figure 2 for Audio-visual video-to-speech synthesis with synthesized input audio
Figure 3 for Audio-visual video-to-speech synthesis with synthesized input audio
Figure 4 for Audio-visual video-to-speech synthesis with synthesized input audio
Viaarxiv icon