Alert button

"speech": models, code, and papers
Alert button

End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study

Add code
Bookmark button
Alert button
Feb 19, 2021
Prashanth Gurunath Shivakumar, Shrikanth Narayanan

Figure 1 for End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Figure 2 for End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Figure 3 for End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Figure 4 for End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Viaarxiv icon

Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System

Mar 10, 2021
Ayush Tripathi, Swapnil Bhosale, Sunil Kumar Kopparapu

Figure 1 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Figure 2 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Figure 3 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Figure 4 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Viaarxiv icon

From Speech-to-Speech Translation to Automatic Dubbing

Jan 19, 2020
Marcello Federico, Robert Enyedi, Roberto Barra-Chicote, Ritwik Giri, Umut Isik, Arvindh Krishnaswamy

Figure 1 for From Speech-to-Speech Translation to Automatic Dubbing
Figure 2 for From Speech-to-Speech Translation to Automatic Dubbing
Figure 3 for From Speech-to-Speech Translation to Automatic Dubbing
Figure 4 for From Speech-to-Speech Translation to Automatic Dubbing
Viaarxiv icon

Improving spatial cues for hearables using a parameterized binaural CDR estimator

Jul 17, 2022
Reza Ghanavi, Craig Jin

Figure 1 for Improving spatial cues for hearables using a parameterized binaural CDR estimator
Figure 2 for Improving spatial cues for hearables using a parameterized binaural CDR estimator
Figure 3 for Improving spatial cues for hearables using a parameterized binaural CDR estimator
Figure 4 for Improving spatial cues for hearables using a parameterized binaural CDR estimator
Viaarxiv icon

MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation

Add code
Bookmark button
Alert button
Apr 26, 2021
Xiyun Li, Yong Xu, Meng Yu, Shi-Xiong Zhang, Jiaming Xu, Bo Xu, Dong Yu

Figure 1 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Figure 2 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Figure 3 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Figure 4 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Viaarxiv icon

Improving Attention-Based Interpretability of Text Classification Transformers

Sep 22, 2022
Nikolaos Mylonas, Ioannis Mollas, Grigorios Tsoumakas

Figure 1 for Improving Attention-Based Interpretability of Text Classification Transformers
Figure 2 for Improving Attention-Based Interpretability of Text Classification Transformers
Figure 3 for Improving Attention-Based Interpretability of Text Classification Transformers
Figure 4 for Improving Attention-Based Interpretability of Text Classification Transformers
Viaarxiv icon

End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks

Add code
Bookmark button
Alert button
Oct 28, 2022
Tamás Grósz, Mittul Singh, Sudarsana Reddy Kadiri, Hemant Kathania, Mikko Kurimo

Figure 1 for End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks
Figure 2 for End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks
Figure 3 for End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks
Figure 4 for End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks
Viaarxiv icon

Continual Learning for Monolingual End-to-End Automatic Speech Recognition

Add code
Bookmark button
Alert button
Dec 17, 2021
Steven Vander Eeckt, Hugo Van hamme

Figure 1 for Continual Learning for Monolingual End-to-End Automatic Speech Recognition
Figure 2 for Continual Learning for Monolingual End-to-End Automatic Speech Recognition
Figure 3 for Continual Learning for Monolingual End-to-End Automatic Speech Recognition
Viaarxiv icon

UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation

Add code
Bookmark button
Alert button
Sep 15, 2021
Qianqian Dong, Yaoming Zhu, Mingxuan Wang, Lei Li

Figure 1 for UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation
Figure 2 for UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation
Figure 3 for UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation
Figure 4 for UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation
Viaarxiv icon

Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings

Add code
Bookmark button
Alert button
Oct 07, 2021
Oktai Tatanov, Stanislav Beliaev, Boris Ginsburg

Figure 1 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Figure 2 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Figure 3 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Figure 4 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Viaarxiv icon