Alert button

"speech": models, code, and papers
Alert button

Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model

Add code
Bookmark button
Alert button
May 11, 2022
Jean-Marc Valin, Ahmed Mustafa, Christopher Montgomery, Timothy B. Terriberry, Michael Klingbeil, Paris Smaragdis, Arvindh Krishnaswamy

Figure 1 for Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model
Figure 2 for Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model
Figure 3 for Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model
Figure 4 for Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model
Viaarxiv icon

DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement

Jul 12, 2021
Xiaohuai Le, Hongsheng Chen, Kai Chen, Jing Lu

Figure 1 for DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement
Figure 2 for DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement
Figure 3 for DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement
Viaarxiv icon

Speaker disentanglement in video-to-speech conversion

May 20, 2021
Dan Oneata, Adriana Stan, Horia Cucu

Figure 1 for Speaker disentanglement in video-to-speech conversion
Figure 2 for Speaker disentanglement in video-to-speech conversion
Figure 3 for Speaker disentanglement in video-to-speech conversion
Figure 4 for Speaker disentanglement in video-to-speech conversion
Viaarxiv icon

Word Order Does Not Matter For Speech Recognition

Oct 18, 2021
Vineel Pratap, Qiantong Xu, Tatiana Likhomanenko, Gabriel Synnaeve, Ronan Collobert

Figure 1 for Word Order Does Not Matter For Speech Recognition
Figure 2 for Word Order Does Not Matter For Speech Recognition
Figure 3 for Word Order Does Not Matter For Speech Recognition
Figure 4 for Word Order Does Not Matter For Speech Recognition
Viaarxiv icon

A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming

Add code
Bookmark button
Alert button
Oct 08, 2021
Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Yu Tsao

Figure 1 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 2 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 3 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 4 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Viaarxiv icon

Neural Speech Synthesis for Estonian

Add code
Bookmark button
Alert button
Oct 06, 2020
Liisa Rätsep, Liisi Piits, Hille Pajupuu, Indrek Hein, Mark Fišel

Viaarxiv icon

Sign-to-Speech Model for Sign Language Understanding: A Case Study of Nigerian Sign Language

Add code
Bookmark button
Alert button
Nov 02, 2021
Steven Kolawole, Opeyemi Osakuade, Nayan Saxena, Babatunde Kazeem Olorisade

Figure 1 for Sign-to-Speech Model for Sign Language Understanding: A Case Study of Nigerian Sign Language
Figure 2 for Sign-to-Speech Model for Sign Language Understanding: A Case Study of Nigerian Sign Language
Figure 3 for Sign-to-Speech Model for Sign Language Understanding: A Case Study of Nigerian Sign Language
Figure 4 for Sign-to-Speech Model for Sign Language Understanding: A Case Study of Nigerian Sign Language
Viaarxiv icon

Cross-speaker style transfer for text-to-speech using data augmentation

Feb 10, 2022
Manuel Sam Ribeiro, Julian Roth, Giulia Comini, Goeric Huybrechts, Adam Gabrys, Jaime Lorenzo-Trueba

Figure 1 for Cross-speaker style transfer for text-to-speech using data augmentation
Figure 2 for Cross-speaker style transfer for text-to-speech using data augmentation
Figure 3 for Cross-speaker style transfer for text-to-speech using data augmentation
Figure 4 for Cross-speaker style transfer for text-to-speech using data augmentation
Viaarxiv icon

Echo State Speech Recognition

Feb 18, 2021
Harsh Shrivastava, Ankush Garg, Yuan Cao, Yu Zhang, Tara Sainath

Figure 1 for Echo State Speech Recognition
Figure 2 for Echo State Speech Recognition
Figure 3 for Echo State Speech Recognition
Viaarxiv icon

FRA-RIR: Fast Random Approximation of the Image-source Method

Add code
Bookmark button
Alert button
Aug 08, 2022
Yi Luo, Jianwei Yu

Figure 1 for FRA-RIR: Fast Random Approximation of the Image-source Method
Figure 2 for FRA-RIR: Fast Random Approximation of the Image-source Method
Viaarxiv icon