Alert button

"speech": models, code, and papers
Alert button

Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning

Jun 23, 2023
Zhongzhi Yu, Yang Zhang, Kaizhi Qian, Yonggan Fu, Yingyan Lin

Figure 1 for Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
Figure 2 for Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
Figure 3 for Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
Figure 4 for Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
Viaarxiv icon

Contrastive Learning-Based Audio to Lyrics Alignment for Multiple Languages

Add code
Bookmark button
Alert button
Jun 13, 2023
Simon Durand, Daniel Stoller, Sebastian Ewert

Figure 1 for Contrastive Learning-Based Audio to Lyrics Alignment for Multiple Languages
Figure 2 for Contrastive Learning-Based Audio to Lyrics Alignment for Multiple Languages
Figure 3 for Contrastive Learning-Based Audio to Lyrics Alignment for Multiple Languages
Figure 4 for Contrastive Learning-Based Audio to Lyrics Alignment for Multiple Languages
Viaarxiv icon

Large-scale Language Model Rescoring on Long-form Data

Jun 13, 2023
Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel Park, David Rybach, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley

Figure 1 for Large-scale Language Model Rescoring on Long-form Data
Figure 2 for Large-scale Language Model Rescoring on Long-form Data
Figure 3 for Large-scale Language Model Rescoring on Long-form Data
Figure 4 for Large-scale Language Model Rescoring on Long-form Data
Viaarxiv icon

TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

Mar 14, 2023
Yukai Ju, Jun Chen, Shimin Zhang, Shulin He, Wei Rao, Weixin Zhu, Yannan Wang, Tao Yu, Shidong Shang

Figure 1 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Figure 2 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Figure 3 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Viaarxiv icon

Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Feb 21, 2023
Leyuan Qu, Cornelius Weber, Stefan Wermter

Figure 1 for Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition
Figure 2 for Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition
Figure 3 for Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition
Figure 4 for Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition
Viaarxiv icon

L2 proficiency assessment using self-supervised speech representations

Nov 16, 2022
Stefano Bannò, Kate M. Knill, Marco Matassoni, Vyas Raina, Mark J. F. Gales

Figure 1 for L2 proficiency assessment using self-supervised speech representations
Figure 2 for L2 proficiency assessment using self-supervised speech representations
Figure 3 for L2 proficiency assessment using self-supervised speech representations
Figure 4 for L2 proficiency assessment using self-supervised speech representations
Viaarxiv icon

Synthetic Wave-Geometric Impulse Responses for Improved Speech Dereverberation

Add code
Bookmark button
Alert button
Dec 10, 2022
Rohith Aralikatti, Zhenyu Tang, Dinesh Manocha

Figure 1 for Synthetic Wave-Geometric Impulse Responses for Improved Speech Dereverberation
Figure 2 for Synthetic Wave-Geometric Impulse Responses for Improved Speech Dereverberation
Figure 3 for Synthetic Wave-Geometric Impulse Responses for Improved Speech Dereverberation
Figure 4 for Synthetic Wave-Geometric Impulse Responses for Improved Speech Dereverberation
Viaarxiv icon

ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems

Add code
Bookmark button
Alert button
Feb 11, 2023
Daniel Hao Xian Yuen, Andrew Yong Chen Pang, Zhou Yang, Chun Yong Chong, Mei Kuan Lim, David Lo

Figure 1 for ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems
Viaarxiv icon

Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation

Nov 22, 2022
Vinay Kothapally, John H. L. Hansen

Figure 1 for Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Figure 2 for Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Figure 3 for Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Figure 4 for Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Viaarxiv icon

hierarchical network with decoupled knowledge distillation for speech emotion recognition

Mar 09, 2023
Ziping Zhao, Huan Wang, Haishuai Wang, Bjorn Schuller

Figure 1 for hierarchical network with decoupled knowledge distillation for speech emotion recognition
Figure 2 for hierarchical network with decoupled knowledge distillation for speech emotion recognition
Figure 3 for hierarchical network with decoupled knowledge distillation for speech emotion recognition
Figure 4 for hierarchical network with decoupled knowledge distillation for speech emotion recognition
Viaarxiv icon