Alert button

"speech": models, code, and papers
Alert button

A Recurrent Variational Autoencoder for Speech Enhancement

Add code
Bookmark button
Alert button
Oct 24, 2019
Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud

Figure 1 for A Recurrent Variational Autoencoder for Speech Enhancement
Figure 2 for A Recurrent Variational Autoencoder for Speech Enhancement
Figure 3 for A Recurrent Variational Autoencoder for Speech Enhancement
Figure 4 for A Recurrent Variational Autoencoder for Speech Enhancement
Viaarxiv icon

DeepFry: Identifying Vocal Fry Using Deep Neural Networks

Add code
Bookmark button
Alert button
Mar 31, 2022
Bronya R. Chernyak, Talia Ben Simon, Yael Segal, Jeremy Steffman, Eleanor Chodroff, Jennifer S. Cole, Joseph Keshet

Figure 1 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Figure 2 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Figure 3 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Figure 4 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Viaarxiv icon

Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection

Add code
Bookmark button
Alert button
May 25, 2021
Djamila Romaissa Beddiar, Md Saroar Jahan, Mourad Oussalah

Figure 1 for Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection
Figure 2 for Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection
Figure 3 for Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection
Figure 4 for Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection
Viaarxiv icon

Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition

Dec 22, 2020
Shoma Ishida, Satoshi Ono

Figure 1 for Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition
Figure 2 for Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition
Figure 3 for Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition
Figure 4 for Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition
Viaarxiv icon

Emotion-Controllable Generalized Talking Face Generation

Add code
Bookmark button
Alert button
May 02, 2022
Sanjana Sinha, Sandika Biswas, Ravindra Yadav, Brojeshwar Bhowmick

Figure 1 for Emotion-Controllable Generalized Talking Face Generation
Figure 2 for Emotion-Controllable Generalized Talking Face Generation
Figure 3 for Emotion-Controllable Generalized Talking Face Generation
Figure 4 for Emotion-Controllable Generalized Talking Face Generation
Viaarxiv icon

Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition

Feb 14, 2021
Priyabrata Karmakar, Shyh Wei Teng, Guojun Lu

Figure 1 for Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Figure 2 for Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Figure 3 for Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Figure 4 for Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Viaarxiv icon

Enhance Language Identification using Dual-mode Model with Knowledge Distillation

Mar 07, 2022
Hexin Liu, Leibny Paola Garcia Perera, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles, Sanjeev Khudanpur

Figure 1 for Enhance Language Identification using Dual-mode Model with Knowledge Distillation
Figure 2 for Enhance Language Identification using Dual-mode Model with Knowledge Distillation
Figure 3 for Enhance Language Identification using Dual-mode Model with Knowledge Distillation
Figure 4 for Enhance Language Identification using Dual-mode Model with Knowledge Distillation
Viaarxiv icon

Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement

Add code
Bookmark button
Alert button
Apr 12, 2021
Liming Zhou, Yongyu Gao, Ziluo Wang, Jiwei Li, Wenbin Zhang

Figure 1 for Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement
Figure 2 for Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement
Figure 3 for Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement
Figure 4 for Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement
Viaarxiv icon

FaceFilter: Audio-visual speech separation using still images

May 14, 2020
Soo-Whan Chung, Soyeon Choe, Joon Son Chung, Hong-Goo Kang

Figure 1 for FaceFilter: Audio-visual speech separation using still images
Figure 2 for FaceFilter: Audio-visual speech separation using still images
Figure 3 for FaceFilter: Audio-visual speech separation using still images
Figure 4 for FaceFilter: Audio-visual speech separation using still images
Viaarxiv icon

Space-Efficient Representation of Entity-centric Query Language Models

Add code
Bookmark button
Alert button
Jun 29, 2022
Christophe Van Gysel, Mirko Hannemann, Ernest Pusateri, Youssef Oualil, Ilya Oparin

Figure 1 for Space-Efficient Representation of Entity-centric Query Language Models
Figure 2 for Space-Efficient Representation of Entity-centric Query Language Models
Figure 3 for Space-Efficient Representation of Entity-centric Query Language Models
Figure 4 for Space-Efficient Representation of Entity-centric Query Language Models
Viaarxiv icon