Alert button
Picture for Berrak Sisman

Berrak Sisman

Alert button

emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition

Add code
Bookmark button
Alert button
Mar 21, 2024
Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Bjorn W. Schuller, Carlos Busso

Figure 1 for emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition
Figure 2 for emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition
Figure 3 for emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition
Figure 4 for emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition
Viaarxiv icon

Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Jan 19, 2024
Ismail Rasim Ulgen, Zongyang Du, Carlos Busso, Berrak Sisman

Viaarxiv icon

High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units

Add code
Bookmark button
Alert button
Jun 29, 2023
Junchen Lu, Berrak Sisman, Mingyang Zhang, Haizhou Li

Figure 1 for High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units
Figure 2 for High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units
Figure 3 for High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units
Figure 4 for High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units
Viaarxiv icon

Improving Speech Emotion Recognition Performance using Differentiable Architecture Search

Add code
Bookmark button
Alert button
May 23, 2023
Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Björn Schuller

Figure 1 for Improving Speech Emotion Recognition Performance using Differentiable Architecture Search
Figure 2 for Improving Speech Emotion Recognition Performance using Differentiable Architecture Search
Figure 3 for Improving Speech Emotion Recognition Performance using Differentiable Architecture Search
Figure 4 for Improving Speech Emotion Recognition Performance using Differentiable Architecture Search
Viaarxiv icon

Versatile Audio-Visual Learning for Handling Single and Multi Modalities in Emotion Regression and Classification Tasks

Add code
Bookmark button
Alert button
May 12, 2023
Lucas Goncalves, Seong-Gyun Leem, Wei-Cheng Lin, Berrak Sisman, Carlos Busso

Figure 1 for Versatile Audio-Visual Learning for Handling Single and Multi Modalities in Emotion Regression and Classification Tasks
Figure 2 for Versatile Audio-Visual Learning for Handling Single and Multi Modalities in Emotion Regression and Classification Tasks
Figure 3 for Versatile Audio-Visual Learning for Handling Single and Multi Modalities in Emotion Regression and Classification Tasks
Figure 4 for Versatile Audio-Visual Learning for Handling Single and Multi Modalities in Emotion Regression and Classification Tasks
Viaarxiv icon

SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech

Add code
Bookmark button
Alert button
Nov 14, 2022
Perry Lam, Huayun Zhang, Nancy F. Chen, Berrak Sisman, Dorien Herremans

Figure 1 for SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech
Figure 2 for SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech
Figure 3 for SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech
Figure 4 for SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech
Viaarxiv icon

Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder

Add code
Bookmark button
Alert button
Nov 07, 2022
Jan Melechovsky, Ambuj Mehrish, Berrak Sisman, Dorien Herremans

Figure 1 for Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
Figure 2 for Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
Figure 3 for Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
Figure 4 for Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
Viaarxiv icon

Mixed Emotion Modelling for Emotional Voice Conversion

Add code
Bookmark button
Alert button
Oct 26, 2022
Kun Zhou, Berrak Sisman, Carlos Busso, Haizhou Li

Figure 1 for Mixed Emotion Modelling for Emotional Voice Conversion
Figure 2 for Mixed Emotion Modelling for Emotional Voice Conversion
Figure 3 for Mixed Emotion Modelling for Emotional Voice Conversion
Figure 4 for Mixed Emotion Modelling for Emotional Voice Conversion
Viaarxiv icon

EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models

Add code
Bookmark button
Alert button
Sep 22, 2022
Perry Lam, Huayun Zhang, Nancy F. Chen, Berrak Sisman

Figure 1 for EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models
Figure 2 for EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models
Figure 3 for EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models
Figure 4 for EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models
Viaarxiv icon

Controllable Accented Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Sep 22, 2022
Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li

Figure 1 for Controllable Accented Text-to-Speech Synthesis
Figure 2 for Controllable Accented Text-to-Speech Synthesis
Figure 3 for Controllable Accented Text-to-Speech Synthesis
Figure 4 for Controllable Accented Text-to-Speech Synthesis
Viaarxiv icon