Alert button
Picture for Grzegorz Beringer

Grzegorz Beringer

Alert button

Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations

Add code
Bookmark button
Alert button
Feb 05, 2024
Álvaro Martín-Cortinas, Daniel Sáez-Trigueros, Iván Vallés-Pérez, Biel Tura-Vecino, Piotr Biliński, Mateusz Lajszczak, Grzegorz Beringer, Roberto Barra-Chicote, Jaime Lorenzo-Trueba

Viaarxiv icon

SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces

Add code
Bookmark button
Alert button
Jul 23, 2023
Ivan Vallés-Pérez, Grzegorz Beringer, Piotr Bilinski, Gary Cook, Roberto Barra-Chicote

Figure 1 for SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces
Figure 2 for SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces
Figure 3 for SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces
Figure 4 for SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces
Viaarxiv icon

GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion

Add code
Bookmark button
Alert button
Jul 04, 2022
Magdalena Proszewska, Grzegorz Beringer, Daniel Sáez-Trigueros, Thomas Merritt, Abdelhamid Ezzerg, Roberto Barra-Chicote

Figure 1 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 2 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 3 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Figure 4 for GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Viaarxiv icon

Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows

Add code
Bookmark button
Alert button
Jun 10, 2021
Iván Vallés-Pérez, Julian Roth, Grzegorz Beringer, Roberto Barra-Chicote, Jasha Droppo

Figure 1 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 2 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 3 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 4 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Viaarxiv icon

Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention

Add code
Bookmark button
Alert button
Dec 29, 2020
Daniel Korzekwa, Roberto Barra-Chicote, Szymon Zaporowski, Grzegorz Beringer, Jaime Lorenzo-Trueba, Alicja Serafinowicz, Jasha Droppo, Thomas Drugman, Bozena Kostek

Figure 1 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Figure 2 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Figure 3 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Figure 4 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Viaarxiv icon