Alert button

"speech": models, code, and papers
Alert button

Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement

Aug 27, 2021
Yuzi Yan, Wei-Qiang Zhang, Michael T. Johnson

Figure 1 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Figure 2 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Figure 3 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Figure 4 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Viaarxiv icon

Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features

Dec 10, 2021
Yicheng Hsu, Yonghan Lee, Mingsian R. Bai

Figure 1 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Figure 2 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Figure 3 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Figure 4 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Viaarxiv icon

Zero-shot Speech Translation

Jul 13, 2021
Tu Anh Dinh

Figure 1 for Zero-shot Speech Translation
Figure 2 for Zero-shot Speech Translation
Figure 3 for Zero-shot Speech Translation
Figure 4 for Zero-shot Speech Translation
Viaarxiv icon

Deploying self-supervised learning in the wild for hybrid automatic speech recognition

May 17, 2022
Mostafa Karimi, Changliang Liu, Kenichi Kumatani, Yao Qian, Tianyu Wu, Jian Wu

Figure 1 for Deploying self-supervised learning in the wild for hybrid automatic speech recognition
Figure 2 for Deploying self-supervised learning in the wild for hybrid automatic speech recognition
Figure 3 for Deploying self-supervised learning in the wild for hybrid automatic speech recognition
Figure 4 for Deploying self-supervised learning in the wild for hybrid automatic speech recognition
Viaarxiv icon

Speaker Anonymization with Phonetic Intermediate Representations

Jul 11, 2022
Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu

Figure 1 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 2 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 3 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 4 for Speaker Anonymization with Phonetic Intermediate Representations
Viaarxiv icon

Towards Automatic Speech to Sign Language Generation

Jun 24, 2021
Parul Kapoor, Rudrabha Mukhopadhyay, Sindhu B Hegde, Vinay Namboodiri, C V Jawahar

Figure 1 for Towards Automatic Speech to Sign Language Generation
Figure 2 for Towards Automatic Speech to Sign Language Generation
Figure 3 for Towards Automatic Speech to Sign Language Generation
Figure 4 for Towards Automatic Speech to Sign Language Generation
Viaarxiv icon

Toroidal Probabilistic Spherical Discriminant Analysis

Oct 27, 2022
Anna Silnova, Niko Brümmer, Albert Swart, Lukáš Burget

Figure 1 for Toroidal Probabilistic Spherical Discriminant Analysis
Figure 2 for Toroidal Probabilistic Spherical Discriminant Analysis
Viaarxiv icon

A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain

Jan 28, 2022
Kishan Gupta, Srikanth Korse, Bernd Edler, Guillaume Fuchs

Figure 1 for A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain
Figure 2 for A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain
Figure 3 for A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain
Figure 4 for A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain
Viaarxiv icon

A Conformer Based Acoustic Model for Robust Automatic Speech Recognition

Mar 20, 2022
Yufeng Yang, Peidong Wang, DeLiang Wang

Figure 1 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 2 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 3 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 4 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Viaarxiv icon

Speech Prediction using an Adaptive Recurrent Neural Network with Application to Packet Loss Concealment

Nov 15, 2021
Reza Lotfidereshgi, Philippe Gournay

Figure 1 for Speech Prediction using an Adaptive Recurrent Neural Network with Application to Packet Loss Concealment
Figure 2 for Speech Prediction using an Adaptive Recurrent Neural Network with Application to Packet Loss Concealment
Figure 3 for Speech Prediction using an Adaptive Recurrent Neural Network with Application to Packet Loss Concealment
Figure 4 for Speech Prediction using an Adaptive Recurrent Neural Network with Application to Packet Loss Concealment
Viaarxiv icon