Alert button

"speech recognition": models, code, and papers
Alert button

Seq2seq for Automatic Paraphasia Detection in Aphasic Speech

Dec 16, 2023
Matthew Perez, Duc Le, Amrit Romana, Elise Jones, Keli Licata, Emily Mower Provost

Viaarxiv icon

Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults

Add code
Bookmark button
Alert button
Sep 18, 2023
Ahmed Adel Attia, Jing Liu, Wei Ai, Dorottya Demszky, Carol Espy-Wilson

Figure 1 for Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
Figure 2 for Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
Figure 3 for Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
Figure 4 for Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
Viaarxiv icon

Key Frame Mechanism For Efficient Conformer Based End-to-end Speech Recognition

Add code
Bookmark button
Alert button
Oct 23, 2023
Peng Fan, Changhao Shan, Jianwei Zhang, Sining Sun, Qing Yang

Viaarxiv icon

Optimizing Convolutional Neural Network Architecture

Dec 17, 2023
Luis Balderas, Miguel Lastra, José M. Benítez

Viaarxiv icon

FastInject: Injecting Unpaired Text Data into CTC-based ASR training

Dec 14, 2023
Keqi Deng, Philip C. Woodland

Viaarxiv icon

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models

Dec 20, 2023
Atsunori Ogawa, Naohiro Tawara, Marc Delcroix, Shoko Araki

Viaarxiv icon

Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition

Oct 17, 2023
Atsunori Ogawa, Takafumi Moriya, Naoyuki Kamo, Naohiro Tawara, Marc Delcroix

Viaarxiv icon

A Strong Baseline for Temporal Video-Text Alignment

Dec 21, 2023
Zeqian Li, Qirui Chen, Tengda Han, Ya Zhang, Yanfeng Wang, Weidi Xie

Viaarxiv icon

Audio-visual fine-tuning of audio-only ASR models

Dec 14, 2023
Avner May, Dmitriy Serdyuk, Ankit Parag Shah, Otavio Braga, Olivier Siohan

Viaarxiv icon

BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition

Oct 08, 2023
Peikun Chen, Fan Yu, Yuhao Lian, Hongfei Xue, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie

Figure 1 for BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
Figure 2 for BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
Figure 3 for BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
Figure 4 for BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
Viaarxiv icon