Alert button
Picture for Lukas Drude

Lukas Drude

Alert button

Promptformer: Prompted Conformer Transducer for ASR

Add code
Bookmark button
Alert button
Jan 14, 2024
Sergio Duarte-Torres, Arunasish Sen, Aman Rana, Lukas Drude, Alejandro Gomez-Alanis, Andreas Schwarz, Leif Rädel, Volker Leutnant

Viaarxiv icon

Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jun 12, 2023
Belen Alastruey, Lukas Drude, Jahn Heymann, Simon Wiesler

Figure 1 for Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition
Figure 2 for Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition
Figure 3 for Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition
Figure 4 for Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition
Viaarxiv icon

Contextual-Utterance Training for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Oct 27, 2022
Alejandro Gomez-Alanis, Lukas Drude, Andreas Schwarz, Rupak Vignesh Swaminathan, Simon Wiesler

Figure 1 for Contextual-Utterance Training for Automatic Speech Recognition
Figure 2 for Contextual-Utterance Training for Automatic Speech Recognition
Figure 3 for Contextual-Utterance Training for Automatic Speech Recognition
Figure 4 for Contextual-Utterance Training for Automatic Speech Recognition
Viaarxiv icon

Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget

Add code
Bookmark button
Alert button
Jun 15, 2021
Lukas Drude, Jahn Heymann, Andreas Schwarz, Jean-Marc Valin

Figure 1 for Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget
Figure 2 for Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget
Figure 3 for Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget
Figure 4 for Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget
Viaarxiv icon

Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR

Add code
Bookmark button
Alert button
Jun 04, 2020
Thilo von Neumann, Christoph Boeddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach

Figure 1 for Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Figure 2 for Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Figure 3 for Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Figure 4 for Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Viaarxiv icon

End-to-end training of time domain audio separation and recognition

Add code
Bookmark button
Alert button
Dec 25, 2019
Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Boeddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach

Figure 1 for End-to-end training of time domain audio separation and recognition
Figure 2 for End-to-end training of time domain audio separation and recognition
Figure 3 for End-to-end training of time domain audio separation and recognition
Figure 4 for End-to-end training of time domain audio separation and recognition
Viaarxiv icon

Ene-to-end training of time domain audio separation and recognition

Add code
Bookmark button
Alert button
Dec 18, 2019
Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Boeddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach

Figure 1 for Ene-to-end training of time domain audio separation and recognition
Figure 2 for Ene-to-end training of time domain audio separation and recognition
Figure 3 for Ene-to-end training of time domain audio separation and recognition
Figure 4 for Ene-to-end training of time domain audio separation and recognition
Viaarxiv icon

Demystifying TasNet: A Dissecting Approach

Add code
Bookmark button
Alert button
Nov 20, 2019
Jens Heitkaemper, Darius Jakobeit, Christoph Boeddeker, Lukas Drude, Reinhold Haeb-Umbach

Figure 1 for Demystifying TasNet: A Dissecting Approach
Figure 2 for Demystifying TasNet: A Dissecting Approach
Figure 3 for Demystifying TasNet: A Dissecting Approach
Figure 4 for Demystifying TasNet: A Dissecting Approach
Viaarxiv icon

SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition

Add code
Bookmark button
Alert button
Oct 30, 2019
Lukas Drude, Jens Heitkaemper, Christoph Boeddeker, Reinhold Haeb-Umbach

Figure 1 for SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
Figure 2 for SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
Figure 3 for SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
Figure 4 for SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
Viaarxiv icon

Unsupervised training of neural mask-based beamforming

Add code
Bookmark button
Alert button
Apr 08, 2019
Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach

Figure 1 for Unsupervised training of neural mask-based beamforming
Figure 2 for Unsupervised training of neural mask-based beamforming
Figure 3 for Unsupervised training of neural mask-based beamforming
Figure 4 for Unsupervised training of neural mask-based beamforming
Viaarxiv icon