Alert button

"speech recognition": models, code, and papers
Alert button

Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures

Dec 19, 2023
Lingyun Zuo, Keyu An, Shiliang Zhang, Zhijie Yan

Viaarxiv icon

Acoustic Model Fusion for End-to-end Speech Recognition

Oct 10, 2023
Zhihong Lei, Mingbin Xu, Shiyi Han, Leo Liu, Zhen Huang, Tim Ng, Yuanyuan Zhang, Ernest Pusateri, Mirko Hannemann, Yaqiao Deng, Man-Hung Siu

Figure 1 for Acoustic Model Fusion for End-to-end Speech Recognition
Figure 2 for Acoustic Model Fusion for End-to-end Speech Recognition
Figure 3 for Acoustic Model Fusion for End-to-end Speech Recognition
Figure 4 for Acoustic Model Fusion for End-to-end Speech Recognition
Viaarxiv icon

On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild''

Sep 25, 2023
Arthur Pimentel, Heitor Guimarães, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk

Figure 1 for On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild''
Figure 2 for On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild''
Figure 3 for On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild''
Figure 4 for On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild''
Viaarxiv icon

kNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels

Add code
Bookmark button
Alert button
Dec 21, 2023
Jiaming Zhou, Shiwan Zhao, Yaqi Liu, Wenjia Zeng, Yong Chen, Yong Qin

Viaarxiv icon

Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring

Oct 17, 2023
Ankitha Sudarshan, Vinay Samuel, Parth Patwa, Ibtihel Amara, Aman Chadha

Figure 1 for Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring
Figure 2 for Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring
Figure 3 for Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring
Figure 4 for Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring
Viaarxiv icon

Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model

Add code
Bookmark button
Alert button
Sep 15, 2023
Jeong Hun Yeo, Minsu Kim, Shinji Watanabe, Yong Man Ro

Figure 1 for Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model
Figure 2 for Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model
Figure 3 for Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model
Figure 4 for Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model
Viaarxiv icon

PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition

Dec 13, 2023
Chengxi Lei, Satwinder Singh, Feng Hou, Xiaoyun Jia, Ruili Wang

Viaarxiv icon

Enhancing Code-switching Speech Recognition with Interactive Language Biases

Add code
Bookmark button
Alert button
Sep 29, 2023
Hexin Liu, Leibny Paola Garcia, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur

Figure 1 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 2 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 3 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 4 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Viaarxiv icon

Towards Probing Contact Center Large Language Models

Dec 26, 2023
Varun Nathan, Ayush Kumar, Digvijay Ingle, Jithendra Vepa

Viaarxiv icon

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

Oct 07, 2023
Kaixun Huang, Ao Zhang, Binbin Zhang, Tianyi Xu, Xingchen Song, Lei Xie

Viaarxiv icon