Picture for Jason Pelecanos

Jason Pelecanos

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

Add code
Sep 14, 2023
Viaarxiv icon

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Add code
Mar 21, 2022
Figure 1 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 2 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 3 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 4 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Viaarxiv icon

Parameter-Free Attentive Scoring for Speaker Verification

Add code
Mar 10, 2022
Figure 1 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 2 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 3 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 4 for Parameter-Free Attentive Scoring for Speaker Verification
Viaarxiv icon

SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System

Add code
Apr 26, 2021
Figure 1 for SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System
Figure 2 for SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System
Figure 3 for SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System
Figure 4 for SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System
Viaarxiv icon

Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition

Add code
Apr 05, 2021
Figure 1 for Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Figure 2 for Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Figure 3 for Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Figure 4 for Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Viaarxiv icon

Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech

Add code
Nov 24, 2020
Figure 1 for Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech
Figure 2 for Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech
Figure 3 for Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech
Figure 4 for Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech
Viaarxiv icon

VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

Add code
Sep 09, 2020
Figure 1 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 2 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 3 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Viaarxiv icon

The IBM Speaker Recognition System: Recent Advances and Error Analysis

Add code
May 05, 2016
Figure 1 for The IBM Speaker Recognition System: Recent Advances and Error Analysis
Figure 2 for The IBM Speaker Recognition System: Recent Advances and Error Analysis
Figure 3 for The IBM Speaker Recognition System: Recent Advances and Error Analysis
Figure 4 for The IBM Speaker Recognition System: Recent Advances and Error Analysis
Viaarxiv icon