Alert button

"speech recognition": models, code, and papers
Alert button

Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers

May 23, 2023
Jan Silovsky, Liuhui Deng, Arturo Argueta, Tresi Arvizo, Roger Hsiao, Sasha Kuznietsov, Yiu-Chang Lin, Xiaoqiang Xiao, Yuanyuan Zhang

Figure 1 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Figure 2 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Figure 3 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Figure 4 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Viaarxiv icon

Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition

Sep 05, 2023
Minh Tran, Yufeng Yin, Mohammad Soleymani

Figure 1 for Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition
Figure 2 for Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition
Figure 3 for Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition
Figure 4 for Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition
Viaarxiv icon

Agricultural Robotic System: The Automation of Detection and Speech Control

Jul 19, 2023
Yang Wenkai, Ji Ruihang, Yue Yiran, Gu Zhonghan, Shu Wanyang, Sam Ge Shuzhi

Figure 1 for Agricultural Robotic System: The Automation of Detection and Speech Control
Figure 2 for Agricultural Robotic System: The Automation of Detection and Speech Control
Figure 3 for Agricultural Robotic System: The Automation of Detection and Speech Control
Figure 4 for Agricultural Robotic System: The Automation of Detection and Speech Control
Viaarxiv icon

Improving Accented Speech Recognition with Multi-Domain Training

Mar 14, 2023
Lucas Maison, Yannick Estève

Figure 1 for Improving Accented Speech Recognition with Multi-Domain Training
Figure 2 for Improving Accented Speech Recognition with Multi-Domain Training
Figure 3 for Improving Accented Speech Recognition with Multi-Domain Training
Figure 4 for Improving Accented Speech Recognition with Multi-Domain Training
Viaarxiv icon

Using Text Injection to Improve Recognition of Personal Identifiers in Speech

Aug 14, 2023
Yochai Blau, Rohan Agrawal, Lior Madmony, Gary Wang, Andrew Rosenberg, Zhehuai Chen, Zorik Gekhman, Genady Beryozkin, Parisa Haghani, Bhuvana Ramabhadran

Viaarxiv icon

Cross-modal Alignment with Optimal Transport for CTC-based ASR

Sep 24, 2023
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Viaarxiv icon

Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method

Sep 12, 2023
Juntae Kim, Minkyu Lim, Seokjin Hong

Figure 1 for Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method
Figure 2 for Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method
Figure 3 for Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method
Viaarxiv icon

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

Add code
Bookmark button
Alert button
Sep 15, 2023
Yiling Huang, Weiran Wang, Guanlong Zhao, Hank Liao, Wei Xia, Quan Wang

Figure 1 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 2 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 3 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 4 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Viaarxiv icon

HTEC: Human Transcription Error Correction

Sep 18, 2023
Hanbo Sun, Jian Gao, Xiaomin Wu, Anjie Fang, Cheng Cao, Zheng Du

Figure 1 for HTEC: Human Transcription Error Correction
Figure 2 for HTEC: Human Transcription Error Correction
Figure 3 for HTEC: Human Transcription Error Correction
Figure 4 for HTEC: Human Transcription Error Correction
Viaarxiv icon

Learning Speech Representation From Contrastive Token-Acoustic Pretraining

Add code
Bookmark button
Alert button
Sep 06, 2023
Chunyu Qiang, Hao Li, Yixin Tian, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang

Figure 1 for Learning Speech Representation From Contrastive Token-Acoustic Pretraining
Figure 2 for Learning Speech Representation From Contrastive Token-Acoustic Pretraining
Viaarxiv icon