Alert button

"speech": models, code, and papers
Alert button

Comparing Subjective Perceptions of Robot-to-Human Handover Trajectories

Nov 16, 2022
Alexander Calvert, Wesley Chan, Tin Tran, Sara Sheikholeslami, Rhys Newbury, Akansel Cosgun, Elizabeth Croft

Figure 1 for Comparing Subjective Perceptions of Robot-to-Human Handover Trajectories
Figure 2 for Comparing Subjective Perceptions of Robot-to-Human Handover Trajectories
Figure 3 for Comparing Subjective Perceptions of Robot-to-Human Handover Trajectories
Figure 4 for Comparing Subjective Perceptions of Robot-to-Human Handover Trajectories
Viaarxiv icon

An Attribute-Aligned Strategy for Learning Speech Representation

Jun 05, 2021
Yu-Lin Huang, Bo-Hao Su, Y. -W. Peter Hong, Chi-Chun Lee

Figure 1 for An Attribute-Aligned Strategy for Learning Speech Representation
Figure 2 for An Attribute-Aligned Strategy for Learning Speech Representation
Figure 3 for An Attribute-Aligned Strategy for Learning Speech Representation
Viaarxiv icon

A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate

Add code
Bookmark button
Alert button
Aug 09, 2021
Ahmed Mustafa, Jan Büthe, Srikanth Korse, Kishan Gupta, Guillaume Fuchs, Nicola Pia

Figure 1 for A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate
Figure 2 for A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate
Figure 3 for A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate
Figure 4 for A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate
Viaarxiv icon

T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation

Add code
Bookmark button
Alert button
May 24, 2022
Paul-Ambroise Duquenne, Hongyu Gong, Benoît Sagot, Holger Schwenk

Figure 1 for T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation
Figure 2 for T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation
Figure 3 for T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation
Figure 4 for T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation
Viaarxiv icon

Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks

Nov 04, 2022
Paul Didier, Toon van Waterschoot, Simon Doclo, Marc Moonen

Figure 1 for Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks
Figure 2 for Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks
Figure 3 for Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks
Figure 4 for Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks
Viaarxiv icon

VSEGAN: Visual Speech Enhancement Generative Adversarial Network

Add code
Bookmark button
Alert button
Feb 04, 2021
Xinmeng Xu, Yang Wang, Dongxiang Xu, Yiyuan Peng, Cong Zhang, Jie Jia, Binbin Chen

Figure 1 for VSEGAN: Visual Speech Enhancement Generative Adversarial Network
Figure 2 for VSEGAN: Visual Speech Enhancement Generative Adversarial Network
Figure 3 for VSEGAN: Visual Speech Enhancement Generative Adversarial Network
Figure 4 for VSEGAN: Visual Speech Enhancement Generative Adversarial Network
Viaarxiv icon

Investigation of Practical Aspects of Single Channel Speech Separation for ASR

Jul 05, 2021
Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li

Figure 1 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Figure 2 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Figure 3 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Viaarxiv icon

A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization

Mar 23, 2022
Fadi Biadsy, Youzheng Chen, Xia Zhang, Oleg Rybakov, Andrew Rosenberg, Pedro J. Moreno

Figure 1 for A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Figure 2 for A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Figure 3 for A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Viaarxiv icon

CLSRIL-23: Cross Lingual Speech Representations for Indic Languages

Add code
Bookmark button
Alert button
Jul 15, 2021
Anirudh Gupta, Harveen Singh Chadha, Priyanshi Shah, Neeraj Chimmwal, Ankur Dhuriya, Rishabh Gaur, Vivek Raghavan

Figure 1 for CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Figure 2 for CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Figure 3 for CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Figure 4 for CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Viaarxiv icon

Speech BERT Embedding For Improving Prosody in Neural TTS

Add code
Bookmark button
Alert button
Jun 08, 2021
Liping Chen, Yan Deng, Xi Wang, Frank K. Soong, Lei He

Figure 1 for Speech BERT Embedding For Improving Prosody in Neural TTS
Figure 2 for Speech BERT Embedding For Improving Prosody in Neural TTS
Figure 3 for Speech BERT Embedding For Improving Prosody in Neural TTS
Figure 4 for Speech BERT Embedding For Improving Prosody in Neural TTS
Viaarxiv icon