Alert button

"speech recognition": models, code, and papers
Alert button

Finnish Parliament ASR corpus - Analysis, benchmarks and statistics

Add code
Bookmark button
Alert button
Mar 28, 2022
Anja Virkkunen, Aku Rouhe, Nhan Phan, Mikko Kurimo

Figure 1 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Figure 2 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Figure 3 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Figure 4 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Viaarxiv icon

The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge

Feb 10, 2022
Chen Shen, Yi Liu, Wenzhi Fan, Bin Wang, Shixue Wen, Yao Tian, Jun Zhang, Jingsheng Yang, Zejun Ma

Figure 1 for The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
Figure 2 for The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
Figure 3 for The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
Figure 4 for The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
Viaarxiv icon

Distillation-Resistant Watermarking for Model Protection in NLP

Add code
Bookmark button
Alert button
Oct 07, 2022
Xuandong Zhao, Lei Li, Yu-Xiang Wang

Figure 1 for Distillation-Resistant Watermarking for Model Protection in NLP
Figure 2 for Distillation-Resistant Watermarking for Model Protection in NLP
Figure 3 for Distillation-Resistant Watermarking for Model Protection in NLP
Figure 4 for Distillation-Resistant Watermarking for Model Protection in NLP
Viaarxiv icon

Position-Invariant Truecasing with a Word-and-Character Hierarchical Recurrent Neural Network

Add code
Bookmark button
Alert button
Sep 01, 2021
Hao Zhang, You-Chi Cheng, Shankar Kumar, Mingqing Chen, Rajiv Mathews

Figure 1 for Position-Invariant Truecasing with a Word-and-Character Hierarchical Recurrent Neural Network
Figure 2 for Position-Invariant Truecasing with a Word-and-Character Hierarchical Recurrent Neural Network
Figure 3 for Position-Invariant Truecasing with a Word-and-Character Hierarchical Recurrent Neural Network
Figure 4 for Position-Invariant Truecasing with a Word-and-Character Hierarchical Recurrent Neural Network
Viaarxiv icon

Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass

Feb 08, 2022
Olabanji Shonibare, Xiaosu Tong, Venkatesh Ravichandran

Viaarxiv icon

Towards Structured Deep Neural Network for Automatic Speech Recognition

Nov 08, 2015
Yi-Hsiu Liao, Hung-yi Lee, Lin-shan Lee

Figure 1 for Towards Structured Deep Neural Network for Automatic Speech Recognition
Figure 2 for Towards Structured Deep Neural Network for Automatic Speech Recognition
Figure 3 for Towards Structured Deep Neural Network for Automatic Speech Recognition
Figure 4 for Towards Structured Deep Neural Network for Automatic Speech Recognition
Viaarxiv icon

Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0

Oct 07, 2021
Sameer Khurana, Antoine Laurent, James Glass

Figure 1 for Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0
Figure 2 for Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0
Figure 3 for Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0
Viaarxiv icon

Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition

Sep 23, 2013
Xin Zheng, Zhiyong Wu, Helen Meng, Weifeng Li, Lianhong Cai

Figure 1 for Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition
Figure 2 for Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition
Figure 3 for Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition
Viaarxiv icon

AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario

Add code
Bookmark button
Alert button
Apr 08, 2021
Yihui Fu, Luyao Cheng, Shubo Lv, Yukai Jv, Yuxiang Kong, Zhuo Chen, Yanxin Hu, Lei Xie, Jian Wu, Hui Bu, Xin Xu, Jun Du, Jingdong Chen

Figure 1 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Figure 2 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Figure 3 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Figure 4 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Viaarxiv icon

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition

Add code
Bookmark button
Alert button
Jul 27, 2018
Shubham Toshniwal, Anjuli Kannan, Chung-Cheng Chiu, Yonghui Wu, Tara N Sainath, Karen Livescu

Figure 1 for A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
Figure 2 for A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
Figure 3 for A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
Figure 4 for A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
Viaarxiv icon