Alert button

"speech": models, code, and papers
Alert button

Speech & Song Emotion Recognition Using Multilayer Perceptron and Standard Vector Machine

May 19, 2021
Behzad Javaheri

Figure 1 for Speech & Song Emotion Recognition Using Multilayer Perceptron and Standard Vector Machine
Figure 2 for Speech & Song Emotion Recognition Using Multilayer Perceptron and Standard Vector Machine
Figure 3 for Speech & Song Emotion Recognition Using Multilayer Perceptron and Standard Vector Machine
Figure 4 for Speech & Song Emotion Recognition Using Multilayer Perceptron and Standard Vector Machine
Viaarxiv icon

Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution

Oct 07, 2021
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer

Figure 1 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 2 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 3 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 4 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Viaarxiv icon

Consonant-Vowel Transition Models Based on Deep Learning for Objective Evaluation of Articulation

Add code
Bookmark button
Alert button
Mar 18, 2022
Vikram C. Mathad, Julie M. Liss, Kathy Chapman, Nancy Scherer, Visar Berisha

Figure 1 for Consonant-Vowel Transition Models Based on Deep Learning for Objective Evaluation of Articulation
Figure 2 for Consonant-Vowel Transition Models Based on Deep Learning for Objective Evaluation of Articulation
Figure 3 for Consonant-Vowel Transition Models Based on Deep Learning for Objective Evaluation of Articulation
Figure 4 for Consonant-Vowel Transition Models Based on Deep Learning for Objective Evaluation of Articulation
Viaarxiv icon

A review of on-device fully neural end-to-end automatic speech recognition algorithms

Dec 14, 2020
Chanwoo Kim, Dhananjaya Gowda, Dongsoo Lee, Jiyeon Kim, Ankur Kumar, Sungsoo Kim, Abhinav Garg, Changwoo Han

Figure 1 for A review of on-device fully neural end-to-end automatic speech recognition algorithms
Figure 2 for A review of on-device fully neural end-to-end automatic speech recognition algorithms
Figure 3 for A review of on-device fully neural end-to-end automatic speech recognition algorithms
Figure 4 for A review of on-device fully neural end-to-end automatic speech recognition algorithms
Viaarxiv icon

Which one is more toxic? Findings from Jigsaw Rate Severity of Toxic Comments

Jun 27, 2022
Millon Madhur Das, Punyajoy Saha, Mithun Das

Figure 1 for Which one is more toxic? Findings from Jigsaw Rate Severity of Toxic Comments
Figure 2 for Which one is more toxic? Findings from Jigsaw Rate Severity of Toxic Comments
Viaarxiv icon

Continual Speaker Adaptation for Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Mar 26, 2021
Hamed Hemati, Damian Borth

Figure 1 for Continual Speaker Adaptation for Text-to-Speech Synthesis
Figure 2 for Continual Speaker Adaptation for Text-to-Speech Synthesis
Figure 3 for Continual Speaker Adaptation for Text-to-Speech Synthesis
Figure 4 for Continual Speaker Adaptation for Text-to-Speech Synthesis
Viaarxiv icon

Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech

Apr 02, 2021
Madhurananda Pahar, Thomas Niesler

Figure 1 for Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech
Figure 2 for Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech
Figure 3 for Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech
Figure 4 for Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech
Viaarxiv icon

Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition

Jul 04, 2021
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Takafumi Moriya, Takanori Ashihara, Shota Orihashi, Naoki Makishima

Figure 1 for Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition
Figure 2 for Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition
Figure 3 for Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition
Figure 4 for Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition
Viaarxiv icon

IMS-Speech: A Speech to Text Tool

Add code
Bookmark button
Alert button
Aug 13, 2019
Pavel Denisov, Ngoc Thang Vu

Figure 1 for IMS-Speech: A Speech to Text Tool
Figure 2 for IMS-Speech: A Speech to Text Tool
Figure 3 for IMS-Speech: A Speech to Text Tool
Figure 4 for IMS-Speech: A Speech to Text Tool
Viaarxiv icon

Training end-to-end speech-to-text models on mobile phones

Dec 07, 2021
Zitha S, Raghavendra Rao Suresh, Pooja Rao, T. V. Prabhakar

Figure 1 for Training end-to-end speech-to-text models on mobile phones
Figure 2 for Training end-to-end speech-to-text models on mobile phones
Figure 3 for Training end-to-end speech-to-text models on mobile phones
Figure 4 for Training end-to-end speech-to-text models on mobile phones
Viaarxiv icon