Alert button
Picture for Alexander Waibel

Alexander Waibel

Alert button

From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions

Feb 27, 2024
Fabian Retkowski, Alexander Waibel

Viaarxiv icon

Continuously Learning New Words in Automatic Speech Recognition

Jan 09, 2024
Christian Huber, Alexander Waibel

Viaarxiv icon

Convoifilter: A case study of doing cocktail party speech recognition

Aug 22, 2023
Thai-Binh Nguyen, Alexander Waibel

Figure 1 for Convoifilter: A case study of doing cocktail party speech recognition
Viaarxiv icon

End-to-End Evaluation for Low-Latency Simultaneous Speech Translation

Aug 07, 2023
Christian Huber, Tu Anh Dinh, Carlos Mullov, Ngoc Quan Pham, Thai Binh Nguyen, Fabian Retkowski, Stefan Constantin, Enes Yavuz Ugan, Danni Liu, Zhaolin Li, Sai Koneru, Jan Niehues, Alexander Waibel

Figure 1 for End-to-End Evaluation for Low-Latency Simultaneous Speech Translation
Figure 2 for End-to-End Evaluation for Low-Latency Simultaneous Speech Translation
Figure 3 for End-to-End Evaluation for Low-Latency Simultaneous Speech Translation
Figure 4 for End-to-End Evaluation for Low-Latency Simultaneous Speech Translation
Viaarxiv icon

Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow

Jul 18, 2023
Dogucan Yaman, Fevziye Irem Eyiokur, Leonard Bärmann, Hazim Kemal Ekenel, Alexander Waibel

Figure 1 for Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow
Figure 2 for Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow
Figure 3 for Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow
Figure 4 for Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow
Viaarxiv icon

KIT's Multilingual Speech Translation System for IWSLT 2023

Jun 15, 2023
Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues

Figure 1 for KIT's Multilingual Speech Translation System for IWSLT 2023
Figure 2 for KIT's Multilingual Speech Translation System for IWSLT 2023
Figure 3 for KIT's Multilingual Speech Translation System for IWSLT 2023
Figure 4 for KIT's Multilingual Speech Translation System for IWSLT 2023
Viaarxiv icon

Continually learning new languages

Nov 21, 2022
Ngoc-Quan Pham, Jan Niehues, Alexander Waibel

Figure 1 for Continually learning new languages
Figure 2 for Continually learning new languages
Viaarxiv icon

A Survey on Computer Vision based Human Analysis in the COVID-19 Era

Nov 07, 2022
Fevziye Irem Eyiokur, Alperen Kantarcı, Mustafa Ekrem Erakın, Naser Damer, Ferda Ofli, Muhammad Imran, Janez Križaj, Albert Ali Salah, Alexander Waibel, Vitomir Štruc, Hazım Kemal Ekenel

Figure 1 for A Survey on Computer Vision based Human Analysis in the COVID-19 Era
Figure 2 for A Survey on Computer Vision based Human Analysis in the COVID-19 Era
Figure 3 for A Survey on Computer Vision based Human Analysis in the COVID-19 Era
Figure 4 for A Survey on Computer Vision based Human Analysis in the COVID-19 Era
Viaarxiv icon

Language-agnostic Code-Switching in End-To-End Speech Recognition

Oct 17, 2022
Enes Yavuz Ugan, Christian Huber, Juan Hussain, Alexander Waibel

Figure 1 for Language-agnostic Code-Switching in End-To-End Speech Recognition
Figure 2 for Language-agnostic Code-Switching in End-To-End Speech Recognition
Figure 3 for Language-agnostic Code-Switching in End-To-End Speech Recognition
Figure 4 for Language-agnostic Code-Switching in End-To-End Speech Recognition
Viaarxiv icon