Alert button
Picture for Edresson Casanova

Edresson Casanova

Alert button

MLAAD: The Multi-Language Audio Anti-Spoofing Dataset

Add code
Bookmark button
Alert button
Jan 17, 2024
Nicolas M. Müller, Piotr Kawa, Wei Herng Choong, Edresson Casanova, Eren Gölge, Thorsten Müller, Piotr Syga, Philip Sperl, Konstantin Böttinger

Viaarxiv icon

Evaluation of Speech Representations for MOS prediction

Add code
Bookmark button
Alert button
Jun 16, 2023
Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Lucas R. S. Gris, Anderson S. Soares, Arlindo R. Galvão Filho

Figure 1 for Evaluation of Speech Representations for MOS prediction
Figure 2 for Evaluation of Speech Representations for MOS prediction
Figure 3 for Evaluation of Speech Representations for MOS prediction
Figure 4 for Evaluation of Speech Representations for MOS prediction
Viaarxiv icon

CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages

Add code
Bookmark button
Alert button
Jun 16, 2023
Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Anderson S. Soares, Arlindo R. Galvão Filho

Viaarxiv icon

Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person

Add code
Bookmark button
Alert button
May 26, 2023
Lucas Rafael Stefanel Gris, Ricardo Marcacini, Arnaldo Candido Junior, Edresson Casanova, Anderson Soares, Sandra Maria Aluísio

Figure 1 for Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person
Figure 2 for Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person
Viaarxiv icon

Interpretability Analysis of Deep Models for COVID-19 Detection

Add code
Bookmark button
Alert button
Nov 25, 2022
Daniel Peixoto Pinto da Silva, Edresson Casanova, Lucas Rafael Stefanel Gris, Arnaldo Candido Junior, Marcelo Finger, Flaviane Svartman, Beatriz Raposo, Marcus Vinícius Moreira Martins, Sandra Maria Aluísio, Larissa Cristina Berti, João Paulo Teixeira

Figure 1 for Interpretability Analysis of Deep Models for COVID-19 Detection
Figure 2 for Interpretability Analysis of Deep Models for COVID-19 Detection
Figure 3 for Interpretability Analysis of Deep Models for COVID-19 Detection
Figure 4 for Interpretability Analysis of Deep Models for COVID-19 Detection
Viaarxiv icon

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus

Add code
Bookmark button
Alert button
Jul 07, 2022
Josh Meyer, David Ifeoluwa Adelani, Edresson Casanova, Alp Öktem, Daniel Whitenack Julian Weber, Salomon Kabongo, Elizabeth Salesky, Iroro Orife, Colin Leong, Perez Ogayo, Chris Emezue, Jonathan Mukiibi, Salomey Osei, Apelete Agbolo, Victor Akinode, Bernard Opoku, Samuel Olanrewaju, Jesujoba Alabi, Shamsuddeen Muhammad

Figure 1 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 2 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 3 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 4 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Viaarxiv icon

A single speaker is almost all you need for automatic speech recognition

Add code
Bookmark button
Alert button
Mar 29, 2022
Edresson Casanova, Christopher Shulby, Alexander Korolev, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Aluísio, Moacir Antonelli Ponti

Figure 1 for A single speaker is almost all you need for automatic speech recognition
Figure 2 for A single speaker is almost all you need for automatic speech recognition
Figure 3 for A single speaker is almost all you need for automatic speech recognition
Viaarxiv icon

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Add code
Bookmark button
Alert button
Dec 04, 2021
Edresson Casanova, Julian Weber, Christopher Shulby, Arnaldo Candido Junior, Eren Gölge, Moacir Antonelli Ponti

Figure 1 for YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Figure 2 for YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Figure 3 for YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Figure 4 for YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Viaarxiv icon

CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese

Add code
Bookmark button
Alert button
Oct 14, 2021
Arnaldo Candido Junior, Edresson Casanova, Anderson Soares, Frederico Santos de Oliveira, Lucas Oliveira, Ricardo Corso Fernandes Junior, Daniel Peixoto Pinto da Silva, Fernando Gorgulho Fayet, Bruno Baldissera Carlotto, Lucas Rafael Stefanel Gris, Sandra Maria Aluísio

Figure 1 for CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Figure 2 for CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Figure 3 for CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Figure 4 for CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Viaarxiv icon