Alert button

"speech": models, code, and papers
Alert button

Leveraging End-to-End Speech Recognition with Neural Architecture Search

Dec 11, 2019
Ahmed Baruwa, Mojeed Abisiga, Ibrahim Gbadegesin, Afeez Fakunle

Figure 1 for Leveraging End-to-End Speech Recognition with Neural Architecture Search
Figure 2 for Leveraging End-to-End Speech Recognition with Neural Architecture Search
Figure 3 for Leveraging End-to-End Speech Recognition with Neural Architecture Search
Figure 4 for Leveraging End-to-End Speech Recognition with Neural Architecture Search
Viaarxiv icon

Lip-Reading Driven Deep Learning Approach for Speech Enhancement

Jul 31, 2018
Ahsan Adeel, Mandar Gogate, Amir Hussain, William M. Whitmer

Figure 1 for Lip-Reading Driven Deep Learning Approach for Speech Enhancement
Figure 2 for Lip-Reading Driven Deep Learning Approach for Speech Enhancement
Figure 3 for Lip-Reading Driven Deep Learning Approach for Speech Enhancement
Figure 4 for Lip-Reading Driven Deep Learning Approach for Speech Enhancement
Viaarxiv icon

Does Audio Deepfake Detection Generalize?

Mar 31, 2022
Nicolas M. Müller, Pavel Czempin, Franziska Dieckmann, Adam Froghyar, Konstantin Böttinger

Figure 1 for Does Audio Deepfake Detection Generalize?
Figure 2 for Does Audio Deepfake Detection Generalize?
Figure 3 for Does Audio Deepfake Detection Generalize?
Viaarxiv icon

Class-Conditional Defense GAN Against End-to-End Speech Attacks

Oct 22, 2020
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

Figure 1 for Class-Conditional Defense GAN Against End-to-End Speech Attacks
Figure 2 for Class-Conditional Defense GAN Against End-to-End Speech Attacks
Figure 3 for Class-Conditional Defense GAN Against End-to-End Speech Attacks
Viaarxiv icon

Benchmarking and challenges in security and privacy for voice biometrics

Sep 01, 2021
Jean-Francois Bonastre, Hector Delgado, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noe, Jose Patino, Md Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi

Figure 1 for Benchmarking and challenges in security and privacy for voice biometrics
Figure 2 for Benchmarking and challenges in security and privacy for voice biometrics
Figure 3 for Benchmarking and challenges in security and privacy for voice biometrics
Figure 4 for Benchmarking and challenges in security and privacy for voice biometrics
Viaarxiv icon

Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling

Mar 13, 2019
Peidong Wang, Ke Tan, DeLiang Wang

Figure 1 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 2 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 3 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 4 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Viaarxiv icon

NN3A: Neural Network supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications

Oct 16, 2021
Ziteng Wang, Yueyue Na, Biao Tian, Qiang Fu

Figure 1 for NN3A: Neural Network supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications
Figure 2 for NN3A: Neural Network supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications
Figure 3 for NN3A: Neural Network supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications
Figure 4 for NN3A: Neural Network supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications
Viaarxiv icon

Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to Speech Enhancement

Sep 16, 2017
Nasser Mohammadiha, Arne Leijon

Figure 1 for Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to Speech Enhancement
Figure 2 for Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to Speech Enhancement
Figure 3 for Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to Speech Enhancement
Figure 4 for Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to Speech Enhancement
Viaarxiv icon

Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition

Oct 26, 2019
Thejan Rajapakshe, Rajib Rana, Siddique Latif, Sara Khalifa, Björn W. Schuller

Figure 1 for Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition
Figure 2 for Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition
Figure 3 for Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition
Figure 4 for Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition
Viaarxiv icon

Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining

Add code
Bookmark button
Alert button
Dec 14, 2019
Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda

Figure 1 for Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Figure 2 for Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Figure 3 for Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Figure 4 for Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Viaarxiv icon