Alert button

"speech recognition": models, code, and papers
Alert button

Noise robust distillation of self-supervised speech models via correlation metrics

Dec 19, 2023
Fabian Ritter-Gutierrez, Kuan-Po Huang, Dianwen Ng, Jeremy H. M. Wong, Hung-yi Lee, Eng Siong Chng, Nancy F. Chen

Viaarxiv icon

Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures

Dec 19, 2023
Lingyun Zuo, Keyu An, Shiliang Zhang, Zhijie Yan

Viaarxiv icon

kNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels

Add code
Bookmark button
Alert button
Dec 21, 2023
Jiaming Zhou, Shiwan Zhao, Yaqi Liu, Wenjia Zeng, Yong Chen, Yong Qin

Viaarxiv icon

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

Oct 07, 2023
Kaixun Huang, Ao Zhang, Binbin Zhang, Tianyi Xu, Xingchen Song, Lei Xie

Viaarxiv icon

Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition

Add code
Bookmark button
Alert button
Oct 10, 2023
Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Rohit Kumar, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegner

Figure 1 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Figure 2 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Figure 3 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Figure 4 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Viaarxiv icon

PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition

Dec 13, 2023
Chengxi Lei, Satwinder Singh, Feng Hou, Xiaoyun Jia, Ruili Wang

Viaarxiv icon

Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition

Add code
Bookmark button
Alert button
Sep 26, 2023
Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola Garcia Perera, Daniel Povey, Sanjeev Khudanpur

Figure 1 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 2 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 3 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 4 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Viaarxiv icon

Key Frame Mechanism For Efficient Conformer Based End-to-end Speech Recognition

Add code
Bookmark button
Alert button
Oct 28, 2023
Peng Fan, Changhao Shan, Sining Sun, Qing Yang, Jianwei Zhang

Viaarxiv icon

An analysis of large speech models-based representations for speech emotion recognition

Add code
Bookmark button
Alert button
Nov 01, 2023
Adrian Bogdan Stânea, Vlad Striletchi, Cosmin Striletchi, Adriana Stan

Viaarxiv icon

Extending Whisper with prompt tuning to target-speaker ASR

Dec 13, 2023
Hao Ma, Zhiyuan Peng, Mingjie Shao, Jing Li, Ju Liu

Viaarxiv icon