Alert button

"speech": models, code, and papers
Alert button

A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition

Jul 03, 2022
Ying Hu, Yuwu Tang, Hao Huang, Liang He

Figure 1 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Figure 2 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Figure 3 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Figure 4 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Viaarxiv icon

Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet

Add code
Bookmark button
Alert button
Feb 22, 2022
Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy

Figure 1 for Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet
Figure 2 for Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet
Figure 3 for Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet
Figure 4 for Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet
Viaarxiv icon

Shennong: a Python toolbox for audio speech features extraction

Add code
Bookmark button
Alert button
Dec 10, 2021
Mathieu Bernard, Maxime Poli, Julien Karadayi, Emmanuel Dupoux

Figure 1 for Shennong: a Python toolbox for audio speech features extraction
Figure 2 for Shennong: a Python toolbox for audio speech features extraction
Figure 3 for Shennong: a Python toolbox for audio speech features extraction
Figure 4 for Shennong: a Python toolbox for audio speech features extraction
Viaarxiv icon

Real-time Speech Emotion Recognition Based on Syllable-Level Feature Extraction

Apr 26, 2022
Abdul Rehman, Zhen-Tao Liu, Min Wu, Wei-Hua Cao, Cheng-Shan Jiang

Figure 1 for Real-time Speech Emotion Recognition Based on Syllable-Level Feature Extraction
Figure 2 for Real-time Speech Emotion Recognition Based on Syllable-Level Feature Extraction
Figure 3 for Real-time Speech Emotion Recognition Based on Syllable-Level Feature Extraction
Figure 4 for Real-time Speech Emotion Recognition Based on Syllable-Level Feature Extraction
Viaarxiv icon

A combined approach to the analysis of speech conversations in a contact center domain

Add code
Bookmark button
Alert button
Mar 12, 2022
Andrea Brunello, Enrico Marzano, Angelo Montanari, Guido Sciavicco

Figure 1 for A combined approach to the analysis of speech conversations in a contact center domain
Figure 2 for A combined approach to the analysis of speech conversations in a contact center domain
Figure 3 for A combined approach to the analysis of speech conversations in a contact center domain
Figure 4 for A combined approach to the analysis of speech conversations in a contact center domain
Viaarxiv icon

COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection

Jun 20, 2022
Andreas Triantafyllopoulos, Anastasia Semertzidou, Meishu Song, Florian B. Pokorny, Björn W. Schuller

Figure 1 for COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection
Figure 2 for COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection
Figure 3 for COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection
Figure 4 for COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection
Viaarxiv icon

Listen, denoise, action! Audio-driven motion synthesis with diffusion models

Add code
Bookmark button
Alert button
Nov 17, 2022
Simon Alexanderson, Rajmund Nagy, Jonas Beskow, Gustav Eje Henter

Figure 1 for Listen, denoise, action! Audio-driven motion synthesis with diffusion models
Figure 2 for Listen, denoise, action! Audio-driven motion synthesis with diffusion models
Figure 3 for Listen, denoise, action! Audio-driven motion synthesis with diffusion models
Figure 4 for Listen, denoise, action! Audio-driven motion synthesis with diffusion models
Viaarxiv icon

Data Augmentation for Speech Recognition in Maltese: A Low-Resource Perspective

Add code
Bookmark button
Alert button
Nov 15, 2021
Carlos Mena, Andrea DeMarco, Claudia Borg, Lonneke van der Plas, Albert Gatt

Figure 1 for Data Augmentation for Speech Recognition in Maltese: A Low-Resource Perspective
Figure 2 for Data Augmentation for Speech Recognition in Maltese: A Low-Resource Perspective
Figure 3 for Data Augmentation for Speech Recognition in Maltese: A Low-Resource Perspective
Figure 4 for Data Augmentation for Speech Recognition in Maltese: A Low-Resource Perspective
Viaarxiv icon

FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers

Add code
Bookmark button
Alert button
Jan 09, 2023
Vincent Vandeghinste, Oliver Guhr

Figure 1 for FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers
Figure 2 for FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers
Figure 3 for FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers
Figure 4 for FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers
Viaarxiv icon

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Add code
Bookmark button
Alert button
Mar 29, 2022
Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li

Figure 1 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 2 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 3 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 4 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Viaarxiv icon