Picture for Matthew Wiesner

Matthew Wiesner

Target Speaker ASR with Whisper

Add code
Sep 14, 2024
Viaarxiv icon

Privacy versus Emotion Preservation Trade-offs in Emotion-Preserving Speaker Anonymization

Add code
Sep 05, 2024
Viaarxiv icon

The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization

Add code
Jul 23, 2024
Figure 1 for The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
Figure 2 for The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
Figure 3 for The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
Figure 4 for The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
Viaarxiv icon

On Speaker Attribution with SURT

Add code
Jan 28, 2024
Figure 1 for On Speaker Attribution with SURT
Figure 2 for On Speaker Attribution with SURT
Figure 3 for On Speaker Attribution with SURT
Figure 4 for On Speaker Attribution with SURT
Viaarxiv icon

Speech collage: code-switched audio generation by collaging monolingual corpora

Add code
Sep 27, 2023
Figure 1 for Speech collage: code-switched audio generation by collaging monolingual corpora
Figure 2 for Speech collage: code-switched audio generation by collaging monolingual corpora
Figure 3 for Speech collage: code-switched audio generation by collaging monolingual corpora
Figure 4 for Speech collage: code-switched audio generation by collaging monolingual corpora
Viaarxiv icon

The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios

Add code
Jul 14, 2023
Figure 1 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 2 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 3 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 4 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Viaarxiv icon

HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation

Add code
Jun 20, 2023
Viaarxiv icon

Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts

Add code
Jun 01, 2023
Figure 1 for Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
Figure 2 for Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
Figure 3 for Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
Figure 4 for Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
Viaarxiv icon

Towards Zero-Shot Code-Switched Speech Recognition

Add code
Nov 09, 2022
Figure 1 for Towards Zero-Shot Code-Switched Speech Recognition
Figure 2 for Towards Zero-Shot Code-Switched Speech Recognition
Figure 3 for Towards Zero-Shot Code-Switched Speech Recognition
Figure 4 for Towards Zero-Shot Code-Switched Speech Recognition
Viaarxiv icon

Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models

Add code
Oct 10, 2021
Figure 1 for Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models
Figure 2 for Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models
Figure 3 for Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models
Figure 4 for Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models
Viaarxiv icon