Alert button

"speech recognition": models, code, and papers
Alert button

A meta learning scheme for fast accent domain expansion in Mandarin speech recognition

Jul 23, 2023
Ziwei Zhu, Changhao Shan, Bihong Zhang, Jian Yu

Figure 1 for A meta learning scheme for fast accent domain expansion in Mandarin speech recognition
Figure 2 for A meta learning scheme for fast accent domain expansion in Mandarin speech recognition
Figure 3 for A meta learning scheme for fast accent domain expansion in Mandarin speech recognition
Figure 4 for A meta learning scheme for fast accent domain expansion in Mandarin speech recognition
Viaarxiv icon

MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition

Add code
Bookmark button
Alert button
Oct 27, 2023
Jiamin Xie, John H. L. Hansen

Viaarxiv icon

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation

Add code
Bookmark button
Alert button
Nov 01, 2023
Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico

Viaarxiv icon

Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jul 17, 2023
Shaoshi Ling, Yuxuan Hu, Shuangbei Qian, Guoli Ye, Yao Qian, Yifan Gong, Ed Lin, Michael Zeng

Figure 1 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 2 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 3 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 4 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Viaarxiv icon

Differential Evolution Algorithm based Hyper-Parameters Selection of Convolutional Neural Network for Speech Command Recognition

Add code
Bookmark button
Alert button
Oct 13, 2023
Sandipan Dhar, Anuvab Sen, Aritra Bandyopadhyay, Nanda Dulal Jana, Arjun Ghosh, Zahra Sarayloo

Viaarxiv icon

The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems

Add code
Bookmark button
Alert button
Jul 28, 2023
Andreas Liesenfeld, Alianda Lopez, Mark Dingemanse

Figure 1 for The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems
Figure 2 for The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems
Figure 3 for The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems
Figure 4 for The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems
Viaarxiv icon

Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment

Add code
Bookmark button
Alert button
Jul 06, 2023
Aref Farhadipour, Hadi Veisi

Figure 1 for Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment
Figure 2 for Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment
Figure 3 for Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment
Figure 4 for Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment
Viaarxiv icon

Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement

Sep 03, 2023
Yu-Wen Chen, Julia Hirschberg, Yu Tsao

Figure 1 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Figure 2 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Figure 3 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Figure 4 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Viaarxiv icon

Globally Normalising the Transducer for Streaming Speech Recognition

Jul 20, 2023
Rogier van Dalen

Figure 1 for Globally Normalising the Transducer for Streaming Speech Recognition
Figure 2 for Globally Normalising the Transducer for Streaming Speech Recognition
Figure 3 for Globally Normalising the Transducer for Streaming Speech Recognition
Figure 4 for Globally Normalising the Transducer for Streaming Speech Recognition
Viaarxiv icon

Investigating the Emergent Audio Classification Ability of ASR Foundation Models

Nov 15, 2023
Rao Ma, Adian Liusie, Mark J. F. Gales, Kate M. Knill

Figure 1 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 2 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 3 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 4 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Viaarxiv icon