Alert button

"speech": models, code, and papers
Alert button

Generative linguistic representation for spoken language identification

Dec 18, 2023
Peng Shen, Xuguang Lu, Hisashi Kawai

Viaarxiv icon

Dialect Transfer for Swiss German Speech Translation

Add code
Bookmark button
Alert button
Oct 13, 2023
Claudio Paonessa, Yanick Schraner, Jan Deriu, Manuela Hürlimann, Manfred Vogel, Mark Cieliebak

Figure 1 for Dialect Transfer for Swiss German Speech Translation
Figure 2 for Dialect Transfer for Swiss German Speech Translation
Figure 3 for Dialect Transfer for Swiss German Speech Translation
Figure 4 for Dialect Transfer for Swiss German Speech Translation
Viaarxiv icon

Unified speech and gesture synthesis using flow matching

Add code
Bookmark button
Alert button
Oct 08, 2023
Shivam Mehta, Ruibo Tu, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter

Viaarxiv icon

Creating Spoken Dialog Systems in Ultra-Low Resourced Settings

Dec 11, 2023
Moayad Elamin, Muhammad Omer, Yonas Chanie, Henslaac Ndlovu

Viaarxiv icon

FlowMur: A Stealthy and Practical Audio Backdoor Attack with Limited Knowledge

Dec 15, 2023
Jiahe Lan, Jie Wang, Baochen Yan, Zheng Yan, Elisa Bertino

Viaarxiv icon

Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition

Add code
Bookmark button
Alert button
Dec 22, 2023
Qianrui Zhou, Hua Xu, Hao Li, Hanlei Zhang, Xiaohan Zhang, Yifan Wang, Kai Gao

Viaarxiv icon

FusDom: Combining In-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning

Add code
Bookmark button
Alert button
Dec 20, 2023
Ashish Seth, Sreyan Ghosh, S. Umesh, Dinesh Manocha

Viaarxiv icon

Ultra Low Complexity Deep Learning Based Noise Suppression

Dec 13, 2023
Shrishti Saha Shetu, Soumitro Chakrabarty, Oliver Thiergart, Edwin Mabande

Viaarxiv icon

Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers

Dec 18, 2023
Guru Prakash Arumugam, Shuo-yiin Chang, Tara N. Sainath, Rohit Prabhavalkar, Quan Wang, Shaan Bijwadia

Viaarxiv icon

GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance

Add code
Bookmark button
Alert button
Dec 12, 2023
Haiming Zhang, Zhihao Yuan, Chaoda Zheng, Xu Yan, Baoyuan Wang, Guanbin Li, Song Wu, Shuguang Cui, Zhen Li

Figure 1 for GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Figure 2 for GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Figure 3 for GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Figure 4 for GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Viaarxiv icon