Picture for Tamás Grósz

Tamás Grósz

Advancing Audio Emotion and Intent Recognition with Large Pre-Trained Models and Bayesian Inference

Add code
Oct 16, 2023
Figure 1 for Advancing Audio Emotion and Intent Recognition with Large Pre-Trained Models and Bayesian Inference
Figure 2 for Advancing Audio Emotion and Intent Recognition with Large Pre-Trained Models and Bayesian Inference
Figure 3 for Advancing Audio Emotion and Intent Recognition with Large Pre-Trained Models and Bayesian Inference
Viaarxiv icon

Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic Information

Add code
Jul 21, 2023
Viaarxiv icon

End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks

Add code
Oct 28, 2022
Figure 1 for End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks
Figure 2 for End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks
Figure 3 for End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks
Figure 4 for End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks
Viaarxiv icon

Comparison and Analysis of New Curriculum Criteria for End-to-End ASR

Add code
Aug 10, 2022
Figure 1 for Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Figure 2 for Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Figure 3 for Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Figure 4 for Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Viaarxiv icon

Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks

Add code
Mar 24, 2022
Figure 1 for Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks
Figure 2 for Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks
Figure 3 for Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks
Figure 4 for Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks
Viaarxiv icon

Data augmentation using prosody and false starts to recognize non-native children's speech

Add code
Aug 29, 2020
Figure 1 for Data augmentation using prosody and false starts to recognize non-native children's speech
Figure 2 for Data augmentation using prosody and false starts to recognize non-native children's speech
Figure 3 for Data augmentation using prosody and false starts to recognize non-native children's speech
Figure 4 for Data augmentation using prosody and false starts to recognize non-native children's speech
Viaarxiv icon

Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge

Add code
Aug 06, 2020
Figure 1 for Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge
Figure 2 for Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge
Figure 3 for Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge
Figure 4 for Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge
Viaarxiv icon

GMM-Free Flat Start Sequence-Discriminative DNN Training

Add code
Oct 11, 2016
Figure 1 for GMM-Free Flat Start Sequence-Discriminative DNN Training
Figure 2 for GMM-Free Flat Start Sequence-Discriminative DNN Training
Figure 3 for GMM-Free Flat Start Sequence-Discriminative DNN Training
Viaarxiv icon