Alert button
Picture for Jesús Villalba

Jesús Villalba

Alert button

Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification

Add code
Bookmark button
Alert button
Feb 29, 2024
Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak

Viaarxiv icon

Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning

Add code
Bookmark button
Alert button
Sep 08, 2023
Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velazquez, Thomas Thebaud, Najim Dehak

Figure 1 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 2 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 3 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 4 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Viaarxiv icon

Regularizing Contrastive Predictive Coding for Speech Applications

Add code
Bookmark button
Alert button
Apr 26, 2023
Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko, Laureano Moro-Velazquez, Najim Dehak

Figure 1 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 2 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 3 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 4 for Regularizing Contrastive Predictive Coding for Speech Applications
Viaarxiv icon

Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition

Add code
Bookmark button
Alert button
Mar 07, 2023
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak

Figure 1 for Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition
Figure 2 for Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition
Figure 3 for Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition
Figure 4 for Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition
Viaarxiv icon

Time-domain speech super-resolution with GAN based modeling for telephony speaker verification

Add code
Bookmark button
Alert button
Sep 04, 2022
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Piotr Żelasko, Najim Dehak

Figure 1 for Time-domain speech super-resolution with GAN based modeling for telephony speaker verification
Figure 2 for Time-domain speech super-resolution with GAN based modeling for telephony speaker verification
Figure 3 for Time-domain speech super-resolution with GAN based modeling for telephony speaker verification
Figure 4 for Time-domain speech super-resolution with GAN based modeling for telephony speaker verification
Viaarxiv icon

Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations

Add code
Bookmark button
Alert button
Aug 10, 2022
Jaejin Cho, Raghavendra Pappagari, Piotr Żelasko, Laureano Moro-Velazquez, Jesús Villalba, Najim Dehak

Figure 1 for Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Figure 2 for Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Figure 3 for Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Figure 4 for Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Viaarxiv icon

Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification

Add code
Bookmark button
Alert button
Mar 30, 2022
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak

Figure 1 for Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Figure 2 for Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Figure 3 for Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Figure 4 for Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Viaarxiv icon

Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding

Add code
Bookmark button
Alert button
Oct 08, 2021
Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko, Laureano Moro-Velazquez, Najim Dehak

Figure 1 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Figure 2 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Figure 3 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Figure 4 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Viaarxiv icon

Beyond Isolated Utterances: Conversational Emotion Recognition

Add code
Bookmark button
Alert button
Sep 13, 2021
Raghavendra Pappagari, Piotr Żelasko, Jesús Villalba, Laureano Moro-Velazquez, Najim Dehak

Figure 1 for Beyond Isolated Utterances: Conversational Emotion Recognition
Figure 2 for Beyond Isolated Utterances: Conversational Emotion Recognition
Figure 3 for Beyond Isolated Utterances: Conversational Emotion Recognition
Figure 4 for Beyond Isolated Utterances: Conversational Emotion Recognition
Viaarxiv icon