Picture for Alfonso Ortega

Alfonso Ortega

Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges

Add code
Sep 09, 2024
Viaarxiv icon

Defining and Measuring Disentanglement for non-Independent Factors of Variation

Add code
Aug 13, 2024
Viaarxiv icon

Predefined Prototypes for Intra-Class Separation and Disentanglement

Add code
Jun 23, 2024
Viaarxiv icon

Explainable by-design Audio Segmentation through Non-Negative Matrix Factorization and Probing

Add code
Jun 19, 2024
Viaarxiv icon

Unsupervised Multiple Domain Translation through Controlled Disentanglement in Variational Autoencoder

Add code
Jan 18, 2024
Viaarxiv icon

An Explainable Proxy Model for Multiabel Audio Segmentation

Add code
Jan 17, 2024
Viaarxiv icon

Improved Vocal Effort Transfer Vector Estimation for Vocal Effort-Robust Speaker Verification

Add code
May 03, 2023
Viaarxiv icon

Human-Centric Multimodal Machine Learning: Recent Advances and Testbed on AI-based Recruitment

Add code
Feb 13, 2023
Viaarxiv icon

Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems

Add code
Nov 06, 2021
Figure 1 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Figure 2 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Figure 3 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Figure 4 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Viaarxiv icon

Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data

Add code
Oct 27, 2021
Figure 1 for Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data
Figure 2 for Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data
Figure 3 for Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data
Viaarxiv icon