Alert button

"speech": models, code, and papers
Alert button

Acoustic correlates of the syllabic rhythm of speech: Modulation spectrum or local features of the temporal envelope

Add code
Bookmark button
Alert button
Jan 14, 2023
Yuran Zhang, Jiajie Zou, Nai Ding

Viaarxiv icon

Topological Data Analysis for Speech Processing

Add code
Bookmark button
Alert button
Dec 02, 2022
Eduard Tulchinskii, Kristian Kuznetsov, Laida Kushnareva, Daniil Cherniavskii, Serguei Barannikov, Irina Piontkovskaya, Sergey Nikolenko, Evgeny Burnaev

Figure 1 for Topological Data Analysis for Speech Processing
Figure 2 for Topological Data Analysis for Speech Processing
Figure 3 for Topological Data Analysis for Speech Processing
Figure 4 for Topological Data Analysis for Speech Processing
Viaarxiv icon

Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis

Add code
Bookmark button
Alert button
Jun 06, 2023
Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao

Figure 1 for Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Figure 2 for Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Figure 3 for Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Figure 4 for Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Viaarxiv icon

Speech Aware Dialog System Technology Challenge (DSTC11)

Dec 16, 2022
Hagen Soltau, Izhak Shafran, Mingqiu Wang, Abhinav Rastogi, Jeffrey Zhao, Ye Jia, Wei Han, Yuan Cao, Aramys Miranda

Figure 1 for Speech Aware Dialog System Technology Challenge (DSTC11)
Figure 2 for Speech Aware Dialog System Technology Challenge (DSTC11)
Figure 3 for Speech Aware Dialog System Technology Challenge (DSTC11)
Figure 4 for Speech Aware Dialog System Technology Challenge (DSTC11)
Viaarxiv icon

Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition

Feb 16, 2023
Minsu Kim, Hyung-Il Kim, Yong Man Ro

Figure 1 for Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition
Figure 2 for Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition
Figure 3 for Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition
Figure 4 for Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition
Viaarxiv icon

Evaluation of Virtual Acoustic Environments with Different Acoustic Level of Detail

Jun 29, 2023
Stefan Fichna, Steven van de Par, Stephan D. Ewert

Figure 1 for Evaluation of Virtual Acoustic Environments with Different Acoustic Level of Detail
Figure 2 for Evaluation of Virtual Acoustic Environments with Different Acoustic Level of Detail
Figure 3 for Evaluation of Virtual Acoustic Environments with Different Acoustic Level of Detail
Viaarxiv icon

On the relevance of acoustic measurements for creating realistic virtual acoustic environments

Jun 29, 2023
Siegfried Gündert, Stephan D. Ewert, Steven van de Par

Figure 1 for On the relevance of acoustic measurements for creating realistic virtual acoustic environments
Figure 2 for On the relevance of acoustic measurements for creating realistic virtual acoustic environments
Figure 3 for On the relevance of acoustic measurements for creating realistic virtual acoustic environments
Figure 4 for On the relevance of acoustic measurements for creating realistic virtual acoustic environments
Viaarxiv icon

ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation

Add code
Bookmark button
Alert button
May 29, 2023
Ambuj Mehrish, Abhinav Ramesh Kashyap, Li Yingting, Navonil Majumder, Soujanya Poria

Figure 1 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 2 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 3 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 4 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Viaarxiv icon

Modality Influence in Multimodal Machine Learning

Add code
Bookmark button
Alert button
Jun 10, 2023
Abdelhamid Haouhat, Slimane Bellaouar, Attia Nehar, Hadda Cherroun

Figure 1 for Modality Influence in Multimodal Machine Learning
Figure 2 for Modality Influence in Multimodal Machine Learning
Figure 3 for Modality Influence in Multimodal Machine Learning
Figure 4 for Modality Influence in Multimodal Machine Learning
Viaarxiv icon

Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models

Dec 19, 2022
Yong Cheng, Yu Zhang, Melvin Johnson, Wolfgang Macherey, Ankur Bapna

Figure 1 for Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models
Figure 2 for Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models
Figure 3 for Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models
Figure 4 for Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models
Viaarxiv icon