Alert button

"speech": models, code, and papers
Alert button

Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse

Jan 26, 2024
Seungyoon Lee, Dahyun Jung, Chanjun Park, Seolhwa Lee, Heuiseok Lim

Viaarxiv icon

SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation

Add code
Bookmark button
Alert button
Jan 25, 2024
Dong Zhang, Xin Zhang, Jun Zhan, Shimin Li, Yaqian Zhou, Xipeng Qiu

Viaarxiv icon

Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition

Feb 08, 2024
Lei Liu, Li Liu, Haizhou Li

Viaarxiv icon

Self-supervised speech representation and contextual text embedding for match-mismatch classification with EEG recording

Feb 01, 2024
Bo Wang, Xiran Xu, Zechen Zhang, Haolin Zhu, YuJie Yan, Xihong Wu, Jing Chen

Viaarxiv icon

Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Feb 02, 2024
Dan Lyth, Simon King

Viaarxiv icon

A Comprehensive Study of the Current State-of-the-Art in Nepali Automatic Speech Recognition Systems

Feb 05, 2024
Rupak Raj Ghimire, Bal Krishna Bal, Prakash Poudyal

Viaarxiv icon

Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge

Feb 02, 2024
Simon Leglaive, Matthieu Fraticelli, Hend ElGhazaly, Léonie Borne, Mostafa Sadeghi, Scott Wisdom, Manuel Pariente, John R. Hershey, Daniel Pressnitzer, Jon P. Barker

Viaarxiv icon

SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR

Mar 04, 2024
Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma

Figure 1 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Figure 2 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Figure 3 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Figure 4 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Viaarxiv icon

Automated Generation of Multiple-Choice Cloze Questions for Assessing English Vocabulary Using GPT-turbo 3.5

Add code
Bookmark button
Alert button
Mar 04, 2024
Qiao Wang, Ralph Rose, Naho Orita, Ayaka Sugawara

Figure 1 for Automated Generation of Multiple-Choice Cloze Questions for Assessing English Vocabulary Using GPT-turbo 3.5
Figure 2 for Automated Generation of Multiple-Choice Cloze Questions for Assessing English Vocabulary Using GPT-turbo 3.5
Figure 3 for Automated Generation of Multiple-Choice Cloze Questions for Assessing English Vocabulary Using GPT-turbo 3.5
Figure 4 for Automated Generation of Multiple-Choice Cloze Questions for Assessing English Vocabulary Using GPT-turbo 3.5
Viaarxiv icon

KS-Net: Multi-band joint speech restoration and enhancement network for 2024 ICASSP SSI Challenge

Feb 02, 2024
Guochen Yu, Runqiang Han, Chenglin Xu, Haoran Zhao, Nan Li, Chen Zhang, Xiguang Zheng, Chao Zhou, Qi Huang, Bing Yu

Viaarxiv icon