Alert button
Picture for Ignacio Lopez Moreno

Ignacio Lopez Moreno

Alert button

Locale Encoding For Scalable Multilingual Keyword Spotting Models

Feb 25, 2023
Pai Zhu, Hyun Jin Park, Alex Park, Angelo Scorza Scarpati, Ignacio Lopez Moreno

Figure 1 for Locale Encoding For Scalable Multilingual Keyword Spotting Models
Figure 2 for Locale Encoding For Scalable Multilingual Keyword Spotting Models
Figure 3 for Locale Encoding For Scalable Multilingual Keyword Spotting Models
Figure 4 for Locale Encoding For Scalable Multilingual Keyword Spotting Models
Viaarxiv icon

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

Nov 11, 2022
Guanlong Zhao, Quan Wang, Han Lu, Yiling Huang, Ignacio Lopez Moreno

Figure 1 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 2 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 3 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 4 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Viaarxiv icon

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

Oct 25, 2022
Quan Wang, Yiling Huang, Han Lu, Guanlong Zhao, Ignacio Lopez Moreno

Figure 1 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 2 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 3 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 4 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Viaarxiv icon

Production federated keyword spotting via distillation, filtering, and joint federated-centralized training

Apr 11, 2022
Andrew Hard, Kurt Partridge, Neng Chen, Sean Augenstein, Aishanee Shah, Hyun Jin Park, Alex Park, Sara Ng, Jessica Nguyen, Ignacio Lopez Moreno, Rajiv Mathews, Françoise Beaufays

Figure 1 for Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
Figure 2 for Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
Figure 3 for Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
Figure 4 for Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
Viaarxiv icon

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Mar 21, 2022
Quan Wang, Yang Yu, Jason Pelecanos, Yiling Huang, Ignacio Lopez Moreno

Figure 1 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 2 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 3 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 4 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Viaarxiv icon

Parameter-Free Attentive Scoring for Speaker Verification

Mar 10, 2022
Jason Pelecanos, Quan Wang, Yiling Huang, Ignacio Lopez Moreno

Figure 1 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 2 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 3 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 4 for Parameter-Free Attentive Scoring for Speaker Verification
Viaarxiv icon

Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection

Oct 05, 2021
Wei Xia, Han Lu, Quan Wang, Anshuman Tripathi, Yiling Huang, Ignacio Lopez Moreno, Hasim Sak

Figure 1 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 2 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 3 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 4 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Viaarxiv icon

Noisy student-teacher training for robust keyword spotting

Jun 03, 2021
Hyun-Jin Park, Pai Zhu, Ignacio Lopez Moreno, Niranjan Subrahmanya

Figure 1 for Noisy student-teacher training for robust keyword spotting
Figure 2 for Noisy student-teacher training for robust keyword spotting
Figure 3 for Noisy student-teacher training for robust keyword spotting
Figure 4 for Noisy student-teacher training for robust keyword spotting
Viaarxiv icon