Alert button

"speech": models, code, and papers
Alert button

An Initialization Scheme for Meeting Separation with Spatial Mixture Models

Apr 04, 2022
Christoph Boeddeker, Tobias Cord-Landwehr, Thilo von Neumann, Reinhold Haeb-Umbach

Figure 1 for An Initialization Scheme for Meeting Separation with Spatial Mixture Models
Figure 2 for An Initialization Scheme for Meeting Separation with Spatial Mixture Models
Figure 3 for An Initialization Scheme for Meeting Separation with Spatial Mixture Models
Viaarxiv icon

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Add code
Bookmark button
Alert button
Feb 08, 2021
Renqian Luo, Xu Tan, Rui Wang, Tao Qin, Jinzhu Li, Sheng Zhao, Enhong Chen, Tie-Yan Liu

Figure 1 for LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Figure 2 for LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Figure 3 for LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Figure 4 for LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Viaarxiv icon

Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition

Add code
Bookmark button
Alert button
Sep 08, 2021
Maxime Burchi, Valentin Vielzeuf

Figure 1 for Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition
Figure 2 for Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition
Figure 3 for Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition
Figure 4 for Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition
Viaarxiv icon

Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning

May 27, 2022
Xiliang Zhu, Shayna Gardiner, David Rossouw, Tere Roldán, Simon Corston-Oliver

Figure 1 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 2 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 3 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 4 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Viaarxiv icon

Improving the Training Recipe for a Robust Conformer-based Hybrid Model

Add code
Bookmark button
Alert button
Jun 26, 2022
Mohammad Zeineldeen, Jingjing Xu, Christoph Lüscher, Ralf Schlüter, Hermann Ney

Figure 1 for Improving the Training Recipe for a Robust Conformer-based Hybrid Model
Figure 2 for Improving the Training Recipe for a Robust Conformer-based Hybrid Model
Figure 3 for Improving the Training Recipe for a Robust Conformer-based Hybrid Model
Figure 4 for Improving the Training Recipe for a Robust Conformer-based Hybrid Model
Viaarxiv icon

Contrastive Unsupervised Learning for Speech Emotion Recognition

Feb 12, 2021
Mao Li, Bo Yang, Joshua Levy, Andreas Stolcke, Viktor Rozgic, Spyros Matsoukas, Constantinos Papayiannis, Daniel Bone, Chao Wang

Figure 1 for Contrastive Unsupervised Learning for Speech Emotion Recognition
Figure 2 for Contrastive Unsupervised Learning for Speech Emotion Recognition
Viaarxiv icon

Self-Attention Generative Adversarial Network for Speech Enhancement

Add code
Bookmark button
Alert button
Oct 18, 2020
Huy Phan, Huy Le Nguyen, Oliver Y. Chén, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins

Figure 1 for Self-Attention Generative Adversarial Network for Speech Enhancement
Figure 2 for Self-Attention Generative Adversarial Network for Speech Enhancement
Figure 3 for Self-Attention Generative Adversarial Network for Speech Enhancement
Figure 4 for Self-Attention Generative Adversarial Network for Speech Enhancement
Viaarxiv icon

Non-linear frequency warping using constant-Q transformation for speech emotion recognition

Add code
Bookmark button
Alert button
Feb 08, 2021
Premjeet Singh, Goutam Saha, Md Sahidullah

Figure 1 for Non-linear frequency warping using constant-Q transformation for speech emotion recognition
Figure 2 for Non-linear frequency warping using constant-Q transformation for speech emotion recognition
Figure 3 for Non-linear frequency warping using constant-Q transformation for speech emotion recognition
Figure 4 for Non-linear frequency warping using constant-Q transformation for speech emotion recognition
Viaarxiv icon

Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection

Jan 10, 2022
Jing Du, Shiliang Pu, Qinbo Dong, Chao Jin, Xin Qi, Dian Gu, Ru Wu, Hongwei Zhou

Figure 1 for Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection
Figure 2 for Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection
Figure 3 for Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection
Figure 4 for Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection
Viaarxiv icon

Blind Estimation of Room Acoustic Parameters and Speech Transmission Index using MTF-based CNNs

Mar 14, 2021
Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, Masashi Unoki

Figure 1 for Blind Estimation of Room Acoustic Parameters and Speech Transmission Index using MTF-based CNNs
Figure 2 for Blind Estimation of Room Acoustic Parameters and Speech Transmission Index using MTF-based CNNs
Figure 3 for Blind Estimation of Room Acoustic Parameters and Speech Transmission Index using MTF-based CNNs
Figure 4 for Blind Estimation of Room Acoustic Parameters and Speech Transmission Index using MTF-based CNNs
Viaarxiv icon