Alert button

"speech": models, code, and papers
Alert button

An integrated framework for developing and evaluating an automated lecture style assessment system

Nov 30, 2023
Eleni Dimitriadou, Andreas Lanitis

Figure 1 for An integrated framework for developing and evaluating an automated lecture style assessment system
Figure 2 for An integrated framework for developing and evaluating an automated lecture style assessment system
Figure 3 for An integrated framework for developing and evaluating an automated lecture style assessment system
Figure 4 for An integrated framework for developing and evaluating an automated lecture style assessment system
Viaarxiv icon

Can Whisper perform speech-based in-context learning

Add code
Bookmark button
Alert button
Sep 13, 2023
Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang

Figure 1 for Can Whisper perform speech-based in-context learning
Figure 2 for Can Whisper perform speech-based in-context learning
Figure 3 for Can Whisper perform speech-based in-context learning
Figure 4 for Can Whisper perform speech-based in-context learning
Viaarxiv icon

RAUCG: Retrieval-Augmented Unsupervised Counter Narrative Generation for Hate Speech

Oct 09, 2023
Shuyu Jiang, Wenyi Tang, Xingshu Chen, Rui Tanga, Haizhou Wang, Wenxian Wang

Figure 1 for RAUCG: Retrieval-Augmented Unsupervised Counter Narrative Generation for Hate Speech
Figure 2 for RAUCG: Retrieval-Augmented Unsupervised Counter Narrative Generation for Hate Speech
Figure 3 for RAUCG: Retrieval-Augmented Unsupervised Counter Narrative Generation for Hate Speech
Figure 4 for RAUCG: Retrieval-Augmented Unsupervised Counter Narrative Generation for Hate Speech
Viaarxiv icon

Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition

Add code
Bookmark button
Alert button
Sep 26, 2023
Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola Garcia Perera, Daniel Povey, Sanjeev Khudanpur

Figure 1 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 2 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 3 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 4 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Viaarxiv icon

Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody Modelling

Oct 14, 2023
Tiberiu Boros, Stefan Daniel Dumitrescu, Ionut Mironica, Radu Chivereanu

Viaarxiv icon

FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency

Add code
Bookmark button
Alert button
Sep 22, 2023
Rui Liu, Jiatian Xi, Ziyue Jiang, Haizhou Li

Figure 1 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Figure 2 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Figure 3 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Figure 4 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Viaarxiv icon

A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement

Sep 21, 2023
Bengt J. Borgstrom, Michael S. Brandstein

Figure 1 for A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
Figure 2 for A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
Figure 3 for A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
Figure 4 for A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
Viaarxiv icon

One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition

Add code
Bookmark button
Alert button
Oct 02, 2023
Samuele Cornell, Jee-weon Jung, Shinji Watanabe, Stefano Squartini

Figure 1 for One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Figure 2 for One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Figure 3 for One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Viaarxiv icon

Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization

Add code
Bookmark button
Alert button
Sep 28, 2023
Thilo von Neumann, Christoph Boeddeker, Tobias Cord-Landwehr, Marc Delcroix, Reinhold Haeb-Umbach

Figure 1 for Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Figure 2 for Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Figure 3 for Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Figure 4 for Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Viaarxiv icon

Zipformer: A faster and better encoder for automatic speech recognition

Add code
Bookmark button
Alert button
Oct 17, 2023
Zengwei Yao, Liyong Guo, Xiaoyu Yang, Wei Kang, Fangjun Kuang, Yifan Yang, Zengrui Jin, Long Lin, Daniel Povey

Viaarxiv icon