Picture for Sanjeev Khudanpur

Sanjeev Khudanpur

GenVC: Self-Supervised Zero-Shot Voice Conversion

Add code
Feb 06, 2025
Figure 1 for GenVC: Self-Supervised Zero-Shot Voice Conversion
Figure 2 for GenVC: Self-Supervised Zero-Shot Voice Conversion
Figure 3 for GenVC: Self-Supervised Zero-Shot Voice Conversion
Figure 4 for GenVC: Self-Supervised Zero-Shot Voice Conversion
Viaarxiv icon

DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition

Add code
Dec 30, 2024
Figure 1 for DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Figure 2 for DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Figure 3 for DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Figure 4 for DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Viaarxiv icon

HLTCOE JHU Submission to the Voice Privacy Challenge 2024

Add code
Sep 17, 2024
Figure 1 for HLTCOE JHU Submission to the Voice Privacy Challenge 2024
Figure 2 for HLTCOE JHU Submission to the Voice Privacy Challenge 2024
Figure 3 for HLTCOE JHU Submission to the Voice Privacy Challenge 2024
Figure 4 for HLTCOE JHU Submission to the Voice Privacy Challenge 2024
Viaarxiv icon

Target Speaker ASR with Whisper

Add code
Sep 14, 2024
Figure 1 for Target Speaker ASR with Whisper
Figure 2 for Target Speaker ASR with Whisper
Figure 3 for Target Speaker ASR with Whisper
Figure 4 for Target Speaker ASR with Whisper
Viaarxiv icon

Clean Label Attacks against SLU Systems

Add code
Sep 13, 2024
Figure 1 for Clean Label Attacks against SLU Systems
Figure 2 for Clean Label Attacks against SLU Systems
Figure 3 for Clean Label Attacks against SLU Systems
Figure 4 for Clean Label Attacks against SLU Systems
Viaarxiv icon

Privacy versus Emotion Preservation Trade-offs in Emotion-Preserving Speaker Anonymization

Add code
Sep 05, 2024
Viaarxiv icon

Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation

Add code
Jul 14, 2024
Viaarxiv icon

Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment

Add code
Jun 17, 2024
Figure 1 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Figure 2 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Figure 3 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Figure 4 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Viaarxiv icon

Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge System

Add code
May 17, 2024
Viaarxiv icon

Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages

Add code
May 08, 2024
Figure 1 for Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
Figure 2 for Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
Figure 3 for Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
Figure 4 for Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
Viaarxiv icon