Picture for Ignacio Lopez Moreno

Ignacio Lopez Moreno

Locale Encoding For Scalable Multilingual Keyword Spotting Models

Add code
Feb 25, 2023
Figure 1 for Locale Encoding For Scalable Multilingual Keyword Spotting Models
Figure 2 for Locale Encoding For Scalable Multilingual Keyword Spotting Models
Figure 3 for Locale Encoding For Scalable Multilingual Keyword Spotting Models
Figure 4 for Locale Encoding For Scalable Multilingual Keyword Spotting Models
Viaarxiv icon

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

Add code
Nov 11, 2022
Figure 1 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 2 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 3 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 4 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Viaarxiv icon

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

Add code
Oct 25, 2022
Figure 1 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 2 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 3 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 4 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Viaarxiv icon

Production federated keyword spotting via distillation, filtering, and joint federated-centralized training

Add code
Apr 11, 2022
Figure 1 for Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
Figure 2 for Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
Figure 3 for Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
Figure 4 for Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
Viaarxiv icon

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Add code
Mar 21, 2022
Figure 1 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 2 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 3 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 4 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Viaarxiv icon

Parameter-Free Attentive Scoring for Speaker Verification

Add code
Mar 10, 2022
Figure 1 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 2 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 3 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 4 for Parameter-Free Attentive Scoring for Speaker Verification
Viaarxiv icon

Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection

Add code
Oct 05, 2021
Figure 1 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 2 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 3 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 4 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Viaarxiv icon

Noisy student-teacher training for robust keyword spotting

Add code
Jun 03, 2021
Figure 1 for Noisy student-teacher training for robust keyword spotting
Figure 2 for Noisy student-teacher training for robust keyword spotting
Figure 3 for Noisy student-teacher training for robust keyword spotting
Figure 4 for Noisy student-teacher training for robust keyword spotting
Viaarxiv icon

SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System

Add code
Apr 26, 2021
Figure 1 for SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System
Figure 2 for SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System
Figure 3 for SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System
Figure 4 for SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System
Viaarxiv icon

Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition

Add code
Apr 05, 2021
Figure 1 for Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Figure 2 for Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Figure 3 for Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Figure 4 for Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Viaarxiv icon