Picture for Alexander Waibel

Alexander Waibel

Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion

Add code
Jun 04, 2025
Viaarxiv icon

KIT's Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization

Add code
May 26, 2025
Viaarxiv icon

KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025

Add code
May 19, 2025
Viaarxiv icon

The AI Co-Ethnographer: How Far Can Automation Take Qualitative Research?

Add code
Apr 21, 2025
Viaarxiv icon

From Speech to Summary: A Comprehensive Survey of Speech Summarization

Add code
Apr 10, 2025
Viaarxiv icon

Zero-Shot Strategies for Length-Controllable Summarization

Add code
Dec 31, 2024
Viaarxiv icon

MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models

Add code
Nov 27, 2024
Figure 1 for MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
Figure 2 for MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
Figure 3 for MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
Figure 4 for MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
Viaarxiv icon

Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS

Add code
Oct 19, 2024
Viaarxiv icon

Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck

Add code
Oct 15, 2024
Figure 1 for Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck
Figure 2 for Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck
Figure 3 for Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck
Figure 4 for Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck
Viaarxiv icon

Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems

Add code
Sep 30, 2024
Figure 1 for Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems
Figure 2 for Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems
Figure 3 for Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems
Figure 4 for Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems
Viaarxiv icon