Alert button

"speech": models, code, and papers
Alert button

Seq2seq for Automatic Paraphasia Detection in Aphasic Speech

Dec 16, 2023
Matthew Perez, Duc Le, Amrit Romana, Elise Jones, Keli Licata, Emily Mower Provost

Viaarxiv icon

BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution

Dec 21, 2023
Guochen Yu, Xiguang Zheng, Nan Li, Runqiang Han, Chengshi Zheng, Chen Zhang, Chao Zhou, Qi Huang, Bing Yu

Viaarxiv icon

An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition

Dec 06, 2023
Yukiya Hono, Koh Mitsuda, Tianyu Zhao, Kentaro Mitsui, Toshiaki Wakatsuki, Kei Sawada

Viaarxiv icon

Resource-constrained stereo singing voice cancellation

Jan 22, 2024
Clara Borrelli, James Rae, Dogac Basaran, Matt McVicar, Mehrez Souden, Matthias Mauch

Viaarxiv icon

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation

Dec 18, 2023
Hui Fu, Zeqing Wang, Ke Gong, Keze Wang, Tianshui Chen, Haojie Li, Haifeng Zeng, Wenxiong Kang

Viaarxiv icon

DDD: A Perceptually Superior Low-Response-Time DNN-based Declipper

Jan 08, 2024
Jayeon Yi, Junghyun Koo, Kyogu Lee

Viaarxiv icon

Design, construction and evaluation of emotional multimodal pathological speech database

Dec 14, 2023
Ting Zhu, Shufei Duan, Huizhi Liang, Wei Zhang

Figure 1 for Design, construction and evaluation of emotional multimodal pathological speech database
Figure 2 for Design, construction and evaluation of emotional multimodal pathological speech database
Figure 3 for Design, construction and evaluation of emotional multimodal pathological speech database
Figure 4 for Design, construction and evaluation of emotional multimodal pathological speech database
Viaarxiv icon

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

Jan 26, 2024
Kerlos Atia Abdalmalak, Ascensión Gallardo-Antol'in

Viaarxiv icon

ScoreDec: A Phase-preserving High-Fidelity Audio Codec with A Generalized Score-based Diffusion Post-filter

Jan 22, 2024
Yi-Chiao Wu, Dejan Marković, Steven Krenn, Israel D. Gebru, Alexander Richard

Viaarxiv icon

Attention-Guided Adaptation for Code-Switching Speech Recognition

Dec 14, 2023
Bobbi Aditya, Mahdin Rohmatillah, Liang-Hsuan Tai, Jen-Tzung Chien

Figure 1 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Figure 2 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Figure 3 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Figure 4 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Viaarxiv icon