Alert button

"speech": models, code, and papers
Alert button

KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition

Add code
Bookmark button
Alert button
Sep 26, 2020
Soohwan Kim, Seyoung Bae, Cheolhwang Won

Figure 1 for KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Figure 2 for KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Figure 3 for KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Figure 4 for KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Viaarxiv icon

Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition

Aug 09, 2021
Arash Dehghani, Seyyed Ali Seyyedsalehi

Figure 1 for Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Figure 2 for Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Figure 3 for Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Figure 4 for Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Viaarxiv icon

Relative Positional Encoding for Speech Recognition and Direct Translation

May 20, 2020
Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen, Thai-Son Nguyen, Elizabeth Salesky, Sebastian Stueker, Jan Niehues, Alexander Waibel

Figure 1 for Relative Positional Encoding for Speech Recognition and Direct Translation
Figure 2 for Relative Positional Encoding for Speech Recognition and Direct Translation
Figure 3 for Relative Positional Encoding for Speech Recognition and Direct Translation
Figure 4 for Relative Positional Encoding for Speech Recognition and Direct Translation
Viaarxiv icon

A non-causal FFTNet architecture for speech enhancement

Add code
Bookmark button
Alert button
Jun 08, 2020
Muhammed PV Shifas, Nagaraj Adiga, Vassilis Tsiaras, Yannis Stylianou

Figure 1 for A non-causal FFTNet architecture for speech enhancement
Figure 2 for A non-causal FFTNet architecture for speech enhancement
Figure 3 for A non-causal FFTNet architecture for speech enhancement
Figure 4 for A non-causal FFTNet architecture for speech enhancement
Viaarxiv icon

CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution

Add code
Bookmark button
Alert button
Jul 04, 2022
Taeho Kim, Yongin Kwon, Jemin Lee, Taeho Kim, Sangtae Ha

Figure 1 for CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Figure 2 for CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Figure 3 for CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Figure 4 for CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Viaarxiv icon

On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era

Add code
Bookmark button
Alert button
Apr 20, 2021
Shahin Amiriparian, Artem Sokolov, Ilhan Aslan, Lukas Christ, Maurice Gerczuk, Tobias Hübner, Dmitry Lamanov, Manuel Milling, Sandra Ottl, Ilya Poduremennykh, Evgeniy Shuranov, Björn W. Schuller

Figure 1 for On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
Figure 2 for On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
Figure 3 for On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
Figure 4 for On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
Viaarxiv icon

Refining Automatic Speech Recognition System for older adults

Add code
Bookmark button
Alert button
Nov 17, 2020
Liu Chen, Meysam Asgari

Figure 1 for Refining Automatic Speech Recognition System for older adults
Figure 2 for Refining Automatic Speech Recognition System for older adults
Figure 3 for Refining Automatic Speech Recognition System for older adults
Figure 4 for Refining Automatic Speech Recognition System for older adults
Viaarxiv icon

Data augmentation for learning predictive models on EEG: a systematic comparison

Add code
Bookmark button
Alert button
Jun 29, 2022
Cédric Rommel, Joseph Paillard, Thomas Moreau, Alexandre Gramfort

Figure 1 for Data augmentation for learning predictive models on EEG: a systematic comparison
Figure 2 for Data augmentation for learning predictive models on EEG: a systematic comparison
Figure 3 for Data augmentation for learning predictive models on EEG: a systematic comparison
Figure 4 for Data augmentation for learning predictive models on EEG: a systematic comparison
Viaarxiv icon

A comparison of Vietnamese Statistical Parametric Speech Synthesis Systems

May 26, 2020
Huy Kinh Phan, Viet Lam Phung, Tuan Anh Dinh, Bao Quoc Nguyen

Figure 1 for A comparison of Vietnamese Statistical Parametric Speech Synthesis Systems
Figure 2 for A comparison of Vietnamese Statistical Parametric Speech Synthesis Systems
Figure 3 for A comparison of Vietnamese Statistical Parametric Speech Synthesis Systems
Figure 4 for A comparison of Vietnamese Statistical Parametric Speech Synthesis Systems
Viaarxiv icon

Thutmose Tagger: Single-pass neural model for Inverse Text Normalization

Add code
Bookmark button
Alert button
Jul 29, 2022
Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg

Figure 1 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Figure 2 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Figure 3 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Figure 4 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Viaarxiv icon