Alert button
Picture for Adriana Stan

Adriana Stan

Alert button

An analysis of large speech models-based representations for speech emotion recognition

Add code
Bookmark button
Alert button
Nov 01, 2023
Adrian Bogdan Stânea, Vlad Striletchi, Cosmin Striletchi, Adriana Stan

Viaarxiv icon

Towards generalisable and calibrated synthetic speech detection with self-supervised representations

Add code
Bookmark button
Alert button
Sep 11, 2023
Dan Oneata, Adriana Stan, Octavian Pascu, Elisabeta Oneata, Horia Cucu

Figure 1 for Towards generalisable and calibrated synthetic speech detection with self-supervised representations
Figure 2 for Towards generalisable and calibrated synthetic speech detection with self-supervised representations
Figure 3 for Towards generalisable and calibrated synthetic speech detection with self-supervised representations
Figure 4 for Towards generalisable and calibrated synthetic speech detection with self-supervised representations
Viaarxiv icon

An analysis on the effects of speaker embedding choice in non auto-regressive TTS

Add code
Bookmark button
Alert button
Jul 19, 2023
Adriana Stan, Johannah O'Mahony

Figure 1 for An analysis on the effects of speaker embedding choice in non auto-regressive TTS
Figure 2 for An analysis on the effects of speaker embedding choice in non auto-regressive TTS
Figure 3 for An analysis on the effects of speaker embedding choice in non auto-regressive TTS
Figure 4 for An analysis on the effects of speaker embedding choice in non auto-regressive TTS
Viaarxiv icon

Residual Information in Deep Speaker Embedding Architectures

Add code
Bookmark button
Alert button
Feb 06, 2023
Adriana Stan

Figure 1 for Residual Information in Deep Speaker Embedding Architectures
Figure 2 for Residual Information in Deep Speaker Embedding Architectures
Figure 3 for Residual Information in Deep Speaker Embedding Architectures
Figure 4 for Residual Information in Deep Speaker Embedding Architectures
Viaarxiv icon

The ZevoMOS entry to VoiceMOS Challenge 2022

Add code
Bookmark button
Alert button
Jun 15, 2022
Adriana Stan

Figure 1 for The ZevoMOS entry to VoiceMOS Challenge 2022
Figure 2 for The ZevoMOS entry to VoiceMOS Challenge 2022
Figure 3 for The ZevoMOS entry to VoiceMOS Challenge 2022
Viaarxiv icon

FlexLip: A Controllable Text-to-Lip System

Add code
Bookmark button
Alert button
Jun 07, 2022
Dan Oneata, Beata Lorincz, Adriana Stan, Horia Cucu

Figure 1 for FlexLip: A Controllable Text-to-Lip System
Figure 2 for FlexLip: A Controllable Text-to-Lip System
Figure 3 for FlexLip: A Controllable Text-to-Lip System
Figure 4 for FlexLip: A Controllable Text-to-Lip System
Viaarxiv icon

An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis

Add code
Bookmark button
Alert button
Jun 03, 2021
Beata Lorincz, Adriana Stan, Mircea Giurgiu

Figure 1 for An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis
Figure 2 for An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis
Figure 3 for An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis
Figure 4 for An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis
Viaarxiv icon

Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis

Add code
Bookmark button
Alert button
Jun 03, 2021
Beata Lorincz, Adriana Stan, Mircea Giurgiu

Figure 1 for Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis
Figure 2 for Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis
Figure 3 for Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis
Figure 4 for Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis
Viaarxiv icon

Speaker disentanglement in video-to-speech conversion

Add code
Bookmark button
Alert button
May 20, 2021
Dan Oneata, Adriana Stan, Horia Cucu

Figure 1 for Speaker disentanglement in video-to-speech conversion
Figure 2 for Speaker disentanglement in video-to-speech conversion
Figure 3 for Speaker disentanglement in video-to-speech conversion
Figure 4 for Speaker disentanglement in video-to-speech conversion
Viaarxiv icon

An evaluation of word-level confidence estimation for end-to-end automatic speech recognition

Add code
Bookmark button
Alert button
Jan 14, 2021
Dan Oneata, Alexandru Caranica, Adriana Stan, Horia Cucu

Figure 1 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Figure 2 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Figure 3 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Figure 4 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Viaarxiv icon