Alert button

"speech": models, code, and papers
Alert button

IMS-Speech: A Speech to Text Tool

Aug 13, 2019
Pavel Denisov, Ngoc Thang Vu

Figure 1 for IMS-Speech: A Speech to Text Tool
Figure 2 for IMS-Speech: A Speech to Text Tool
Figure 3 for IMS-Speech: A Speech to Text Tool
Figure 4 for IMS-Speech: A Speech to Text Tool
Viaarxiv icon

LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition

Aug 09, 2020
Jin Xu, Xu Tan, Yi Ren, Tao Qin, Jian Li, Sheng Zhao, Tie-Yan Liu

Figure 1 for LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition
Figure 2 for LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition
Figure 3 for LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition
Figure 4 for LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition
Viaarxiv icon

End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge

Jul 20, 2022
Shakeel Ahmad Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni

Figure 1 for End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge
Figure 2 for End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge
Figure 3 for End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge
Figure 4 for End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge
Viaarxiv icon

Speech-VGG: A deep feature extractor for speech processing

Oct 22, 2019
Pierre Beckmann, Mikolaj Kegler, Hugues Saltini, Milos Cernak

Figure 1 for Speech-VGG: A deep feature extractor for speech processing
Figure 2 for Speech-VGG: A deep feature extractor for speech processing
Figure 3 for Speech-VGG: A deep feature extractor for speech processing
Figure 4 for Speech-VGG: A deep feature extractor for speech processing
Viaarxiv icon

iCub Being Social: Exploiting Social Cues for Interactive Object Detection Learning

Jul 27, 2022
Maria Lombardi, Elisa Maiettini, Vadim Tikhanoff, Lorenzo Natale

Figure 1 for iCub Being Social: Exploiting Social Cues for Interactive Object Detection Learning
Figure 2 for iCub Being Social: Exploiting Social Cues for Interactive Object Detection Learning
Figure 3 for iCub Being Social: Exploiting Social Cues for Interactive Object Detection Learning
Figure 4 for iCub Being Social: Exploiting Social Cues for Interactive Object Detection Learning
Viaarxiv icon

Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis

May 17, 2021
Erica Cooper, Xin Wang, Junichi Yamagishi

Figure 1 for Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Figure 2 for Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Figure 3 for Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Figure 4 for Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Viaarxiv icon

WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis

Jun 20, 2022
Yi Wang, Yi Si

Figure 1 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Figure 2 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Figure 3 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Figure 4 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Viaarxiv icon

SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement

Jun 13, 2020
Luka Chkhetiani, Levan Bejanidze

Figure 1 for SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement
Viaarxiv icon

Compact Graph Architecture for Speech Emotion Recognition

Aug 06, 2020
A. Shirian, T. Guha

Figure 1 for Compact Graph Architecture for Speech Emotion Recognition
Figure 2 for Compact Graph Architecture for Speech Emotion Recognition
Figure 3 for Compact Graph Architecture for Speech Emotion Recognition
Figure 4 for Compact Graph Architecture for Speech Emotion Recognition
Viaarxiv icon

Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT

Jul 16, 2020
Yang Jiao

Figure 1 for Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Figure 2 for Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Figure 3 for Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Figure 4 for Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Viaarxiv icon