Alert button

"speech": models, code, and papers
Alert button

Aligning Speech to Languages to Enhance Code-switching Speech Recognition

Mar 09, 2024
Hexin Liu, Xiangyu Zhang, Leibny Paola Garcia, Andy W. H. Khong, Eng Siong Chng, Shinji Watanabe

Figure 1 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Figure 2 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Figure 3 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Figure 4 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Viaarxiv icon

NaturalTurn: A Method to Segment Transcripts into Naturalistic Conversational Turns

Apr 01, 2024
Gus Cooney, Andrew Reece

Viaarxiv icon

Non-verbal information in spontaneous speech -- towards a new framework of analysis

Mar 13, 2024
Tirza Biron, Moshe Barboy, Eran Ben-Artzy, Alona Golubchik, Yanir Marmor, Smadar Szekely, Yaron Winter, David Harel

Figure 1 for Non-verbal information in spontaneous speech -- towards a new framework of analysis
Figure 2 for Non-verbal information in spontaneous speech -- towards a new framework of analysis
Figure 3 for Non-verbal information in spontaneous speech -- towards a new framework of analysis
Figure 4 for Non-verbal information in spontaneous speech -- towards a new framework of analysis
Viaarxiv icon

A Comparative Analysis of Poetry Reading Audio: Singing, Narrating, or Somewhere In Between?

Mar 31, 2024
Kahyun Choi, Minje Kim

Viaarxiv icon

SuperME: Supervised and Mixture-to-Mixture Co-Learning for Speech Enhancement and Robust ASR

Add code
Bookmark button
Alert button
Mar 15, 2024
Zhong-Qiu Wang

Figure 1 for SuperME: Supervised and Mixture-to-Mixture Co-Learning for Speech Enhancement and Robust ASR
Figure 2 for SuperME: Supervised and Mixture-to-Mixture Co-Learning for Speech Enhancement and Robust ASR
Figure 3 for SuperME: Supervised and Mixture-to-Mixture Co-Learning for Speech Enhancement and Robust ASR
Figure 4 for SuperME: Supervised and Mixture-to-Mixture Co-Learning for Speech Enhancement and Robust ASR
Viaarxiv icon

Decentralised Moderation for Interoperable Social Networks: A Conversation-based Approach for Pleroma and the Fediverse

Apr 03, 2024
Vibhor Agarwal, Aravindh Raman, Nishanth Sastry, Ahmed M. Abdelmoniem, Gareth Tyson, Ignacio Castro

Viaarxiv icon

Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers

Add code
Bookmark button
Alert button
Mar 12, 2024
Changsheng Quan, Xiaofei Li

Figure 1 for Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Figure 2 for Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Figure 3 for Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Viaarxiv icon

Phonetic Segmentation of the UCLA Phonetics Lab Archive

Add code
Bookmark button
Alert button
Mar 28, 2024
Eleanor Chodroff, Blaž Pažon, Annie Baker, Steven Moran

Figure 1 for Phonetic Segmentation of the UCLA Phonetics Lab Archive
Figure 2 for Phonetic Segmentation of the UCLA Phonetics Lab Archive
Figure 3 for Phonetic Segmentation of the UCLA Phonetics Lab Archive
Figure 4 for Phonetic Segmentation of the UCLA Phonetics Lab Archive
Viaarxiv icon

Speech Robust Bench: A Robustness Benchmark For Speech Recognition

Add code
Bookmark button
Alert button
Mar 08, 2024
Muhammad A. Shah, David Solans Noguero, Mikko A. Heikkila, Nicolas Kourtellis

Figure 1 for Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Figure 2 for Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Figure 3 for Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Figure 4 for Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Viaarxiv icon

Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator

Mar 25, 2024
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka

Viaarxiv icon