Alert button

"speech recognition": models, code, and papers
Alert button

Impact of Dataset on Acoustic Models for Automatic Speech Recognition

Mar 25, 2022
Siddhesh Singh

Figure 1 for Impact of Dataset on Acoustic Models for Automatic Speech Recognition
Figure 2 for Impact of Dataset on Acoustic Models for Automatic Speech Recognition
Figure 3 for Impact of Dataset on Acoustic Models for Automatic Speech Recognition
Viaarxiv icon

Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition

Add code
Bookmark button
Alert button
Apr 08, 2022
Nick J. C. Wang, Zongfeng Quan, Shaojun Wang, Jing Xiao

Figure 1 for Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition
Figure 2 for Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition
Figure 3 for Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition
Figure 4 for Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition
Viaarxiv icon

Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jan 26, 2022
Piotr Żelasko, Siyuan Feng, Laureano Moro Velazquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak

Figure 1 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 2 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 3 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 4 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Viaarxiv icon

Speaker Change Detection for Transformer Transducer ASR

Feb 16, 2023
Jian Wu, Zhuo Chen, Min Hu, Xiong Xiao, Jinyu Li

Figure 1 for Speaker Change Detection for Transformer Transducer ASR
Figure 2 for Speaker Change Detection for Transformer Transducer ASR
Figure 3 for Speaker Change Detection for Transformer Transducer ASR
Figure 4 for Speaker Change Detection for Transformer Transducer ASR
Viaarxiv icon

Robustifying automatic speech recognition by extracting slowly varying features

Dec 14, 2021
Matias Pizarro, Dorothea Kolossa, Asja Fischer

Figure 1 for Robustifying automatic speech recognition by extracting slowly varying features
Figure 2 for Robustifying automatic speech recognition by extracting slowly varying features
Figure 3 for Robustifying automatic speech recognition by extracting slowly varying features
Figure 4 for Robustifying automatic speech recognition by extracting slowly varying features
Viaarxiv icon

MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition

Add code
Bookmark button
Alert button
Feb 25, 2021
Linghui Meng, Jin Xu, Xu Tan, Jindong Wang, Tao Qin, Bo Xu

Figure 1 for MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition
Figure 2 for MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition
Figure 3 for MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition
Viaarxiv icon

Recent Advances in End-to-End Automatic Speech Recognition

Nov 02, 2021
Jinyu Li

Figure 1 for Recent Advances in End-to-End Automatic Speech Recognition
Figure 2 for Recent Advances in End-to-End Automatic Speech Recognition
Figure 3 for Recent Advances in End-to-End Automatic Speech Recognition
Figure 4 for Recent Advances in End-to-End Automatic Speech Recognition
Viaarxiv icon

MSAT: Biologically Inspired Multi-Stage Adaptive Threshold for Conversion of Spiking Neural Networks

Mar 23, 2023
Xiang He, Yang Li, Dongcheng Zhao, Qingqun Kong, Yi Zeng

Figure 1 for MSAT: Biologically Inspired Multi-Stage Adaptive Threshold for Conversion of Spiking Neural Networks
Figure 2 for MSAT: Biologically Inspired Multi-Stage Adaptive Threshold for Conversion of Spiking Neural Networks
Figure 3 for MSAT: Biologically Inspired Multi-Stage Adaptive Threshold for Conversion of Spiking Neural Networks
Figure 4 for MSAT: Biologically Inspired Multi-Stage Adaptive Threshold for Conversion of Spiking Neural Networks
Viaarxiv icon

Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation

Add code
Bookmark button
Alert button
Dec 27, 2022
Tomer Wullach, Shlomo E. Chazan

Figure 1 for Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Figure 2 for Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Figure 3 for Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Figure 4 for Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Viaarxiv icon

Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Mar 26, 2022
Xichen Pan, Peiyu Chen, Yichen Gong, Helong Zhou, Xinbing Wang, Zhouhan Lin

Figure 1 for Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
Figure 2 for Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
Figure 3 for Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
Figure 4 for Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
Viaarxiv icon