Alert button

"speech recognition": models, code, and papers
Alert button

Multi-stage Large Language Model Correction for Speech Recognition

Oct 17, 2023
Jie Pu, Thai-Son Nguyen, Sebastian Stüker

Viaarxiv icon

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks

Jan 29, 2024
Bolei Ma, Ercong Nie, Shuzhou Yuan, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze

Viaarxiv icon

Generative Context-aware Fine-tuning of Self-supervised Speech Models

Dec 15, 2023
Suwon Shon, Kwangyoun Kim, Prashant Sridhar, Yi-Te Hsu, Shinji Watanabe, Karen Livescu

Viaarxiv icon

Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks

Add code
Bookmark button
Alert button
Sep 14, 2023
Soumi Maiti, Yifan Peng, Shukjae Choi, Jee-weon Jung, Xuankai Chang, Shinji Watanabe

Figure 1 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Figure 2 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Figure 3 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Figure 4 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Viaarxiv icon

Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition

Nov 17, 2023
Qijie Shao, Pengcheng Guo, Jinghao Yan, Pengfei Hu, Lei Xie

Viaarxiv icon

Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter

Add code
Bookmark button
Alert button
Sep 19, 2023
Song Li, Yongbin You, Xuezhi Wang, Ke Ding, Guanglu Wan

Figure 1 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 2 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 3 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 4 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Viaarxiv icon

Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation

Add code
Bookmark button
Alert button
Oct 22, 2023
Kun Wei, Bei Li, Hang Lv, Quan Lu, Ning Jiang, Lei Xie

Viaarxiv icon

Towards Domain-Specific Cross-Corpus Speech Emotion Recognition Approach

Dec 11, 2023
Yan Zhao, Yuan Zong, Hailun Lian, Cheng Lu, Jingang Shi, Wenming Zheng

Viaarxiv icon

Discriminative Speech Recognition Rescoring with Pre-trained Language Models

Oct 10, 2023
Prashanth Gurunath Shivakumar, Jari Kolehmainen, Yile Gu, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko

Figure 1 for Discriminative Speech Recognition Rescoring with Pre-trained Language Models
Figure 2 for Discriminative Speech Recognition Rescoring with Pre-trained Language Models
Viaarxiv icon

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Bookmark button
Alert button
Oct 27, 2023
Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis

Figure 1 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 2 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 3 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 4 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Viaarxiv icon