Alert button

"speech": models, code, and papers
Alert button

R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation

Jan 11, 2024
Jiaxin Guo, Zhanglin Wu, Zongyao Li, Hengchao Shang, Daimeng Wei, Xiaoyu Chen, Zhiqiang Rao, Shaojun Li, Hao Yang

Viaarxiv icon

Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement

Jan 15, 2024
Ashutosh Pandey, Buye Xu

Viaarxiv icon

The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023

Jan 07, 2024
He Wang, Pengcheng Guo, Wei Chen, Pan Zhou, Lei Xie

Figure 1 for The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023
Figure 2 for The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023
Viaarxiv icon

TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation

Dec 23, 2023
Xize Cheng, Rongjie Huang, Linjun Li, Tao Jin, Zehan Wang, Aoxiong Yin, Minglei Li, Xinyu Duan, changpeng yang, Zhou Zhao

Viaarxiv icon

Style Modeling for Multi-Speaker Articulation-to-Speech

Dec 21, 2023
Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang

Viaarxiv icon

MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition

Jan 07, 2024
He Wang, Pengcheng Guo, Pan Zhou, Lei Xie

Viaarxiv icon

Proactive Detection of Voice Cloning with Localized Watermarking

Add code
Bookmark button
Alert button
Jan 30, 2024
Robin San Roman, Pierre Fernandez, Alexandre Défossez, Teddy Furon, Tuan Tran, Hady Elsahar

Viaarxiv icon

Giving Robots a Voice: Human-in-the-Loop Voice Creation and open-ended Labeling

Feb 07, 2024
Pol van Rijn, Silvan Mertes, Kathrin Janowski, Katharina Weitz, Nori Jacoby, Elisabeth André

Viaarxiv icon

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

Feb 09, 2024
Zhenglin Zhou, Fan Ma, Hehe Fan, Yi Yang

Viaarxiv icon

Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition

Jan 19, 2024
Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth G. Shivakumar, Ankur Gandhe, Ariya Rastow, Jia Xu, Ivan Bulyko, Andreas Stolcke

Viaarxiv icon