Alert button
Picture for Iván Vallés-Pérez

Iván Vallés-Pérez

Alert button

Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations

Add code
Bookmark button
Alert button
Feb 05, 2024
Álvaro Martín-Cortinas, Daniel Sáez-Trigueros, Iván Vallés-Pérez, Biel Tura-Vecino, Piotr Biliński, Mateusz Lajszczak, Grzegorz Beringer, Roberto Barra-Chicote, Jaime Lorenzo-Trueba

Viaarxiv icon

Empirical study of the modulus as activation function in computer vision applications

Add code
Bookmark button
Alert button
Jan 15, 2023
Iván Vallés-Pérez, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, Joan Vila-Francés, Juan Gómez-Sanchís

Figure 1 for Empirical study of the modulus as activation function in computer vision applications
Figure 2 for Empirical study of the modulus as activation function in computer vision applications
Figure 3 for Empirical study of the modulus as activation function in computer vision applications
Figure 4 for Empirical study of the modulus as activation function in computer vision applications
Viaarxiv icon

Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech

Add code
Bookmark button
Alert button
Nov 04, 2022
Xin Zhang, Iván Vallés-Pérez, Andreas Stolcke, Chengzhu Yu, Jasha Droppo, Olabanji Shonibare, Roberto Barra-Chicote, Venkatesh Ravichandran

Figure 1 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 2 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 3 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 4 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Viaarxiv icon

Approaching sales forecasting using recurrent neural networks and transformers

Add code
Bookmark button
Alert button
Apr 16, 2022
Iván Vallés-Pérez, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, Juan Gómez-Sanchís, Fernando Mateo

Figure 1 for Approaching sales forecasting using recurrent neural networks and transformers
Figure 2 for Approaching sales forecasting using recurrent neural networks and transformers
Figure 3 for Approaching sales forecasting using recurrent neural networks and transformers
Figure 4 for Approaching sales forecasting using recurrent neural networks and transformers
Viaarxiv icon

End-to-end Keyword Spotting using Xception-1d

Add code
Bookmark button
Alert button
Oct 09, 2021
Iván Vallés-Pérez, Juan Gómez-Sanchis, Marcelino Martínez-Sober, Joan Vila-Francés, Antonio J. Serrano-López, Emilio Soria-Olivas

Figure 1 for End-to-end Keyword Spotting using Xception-1d
Figure 2 for End-to-end Keyword Spotting using Xception-1d
Figure 3 for End-to-end Keyword Spotting using Xception-1d
Viaarxiv icon

Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows

Add code
Bookmark button
Alert button
Jun 10, 2021
Iván Vallés-Pérez, Julian Roth, Grzegorz Beringer, Roberto Barra-Chicote, Jasha Droppo

Figure 1 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 2 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 3 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 4 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Viaarxiv icon