Picture for Alex Waibel

Alex Waibel

Karlsruhe Institute of Technology

Super-Human Performance in Online Low-latency Recognition of Conversational Speech

Add code
Oct 22, 2020
Figure 1 for Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Figure 2 for Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Figure 3 for Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Figure 4 for Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Viaarxiv icon

Error-correction and extraction in request dialogs

Add code
Apr 08, 2020
Figure 1 for Error-correction and extraction in request dialogs
Figure 2 for Error-correction and extraction in request dialogs
Figure 3 for Error-correction and extraction in request dialogs
Figure 4 for Error-correction and extraction in request dialogs
Viaarxiv icon

High Performance Sequence-to-Sequence Model for Streaming Speech Recognition

Add code
Mar 22, 2020
Figure 1 for High Performance Sequence-to-Sequence Model for Streaming Speech Recognition
Figure 2 for High Performance Sequence-to-Sequence Model for Streaming Speech Recognition
Figure 3 for High Performance Sequence-to-Sequence Model for Streaming Speech Recognition
Figure 4 for High Performance Sequence-to-Sequence Model for Streaming Speech Recognition
Viaarxiv icon

Low Latency ASR for Simultaneous Speech Translation

Add code
Mar 22, 2020
Figure 1 for Low Latency ASR for Simultaneous Speech Translation
Figure 2 for Low Latency ASR for Simultaneous Speech Translation
Figure 3 for Low Latency ASR for Simultaneous Speech Translation
Figure 4 for Low Latency ASR for Simultaneous Speech Translation
Viaarxiv icon

Toward Cross-Domain Speech Recognition with End-to-End Models

Add code
Mar 09, 2020
Figure 1 for Toward Cross-Domain Speech Recognition with End-to-End Models
Figure 2 for Toward Cross-Domain Speech Recognition with End-to-End Models
Figure 3 for Toward Cross-Domain Speech Recognition with End-to-End Models
Figure 4 for Toward Cross-Domain Speech Recognition with End-to-End Models
Viaarxiv icon

An Interactive Indoor Drone Assistant

Add code
Dec 09, 2019
Figure 1 for An Interactive Indoor Drone Assistant
Figure 2 for An Interactive Indoor Drone Assistant
Figure 3 for An Interactive Indoor Drone Assistant
Figure 4 for An Interactive Indoor Drone Assistant
Viaarxiv icon

Bimodal Speech Emotion Recognition Using Pre-Trained Language Models

Add code
Nov 29, 2019
Figure 1 for Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Figure 2 for Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Figure 3 for Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Figure 4 for Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Viaarxiv icon

Low-Resource Machine Translation using Interlinear Glosses

Add code
Nov 07, 2019
Figure 1 for Low-Resource Machine Translation using Interlinear Glosses
Figure 2 for Low-Resource Machine Translation using Interlinear Glosses
Figure 3 for Low-Resource Machine Translation using Interlinear Glosses
Figure 4 for Low-Resource Machine Translation using Interlinear Glosses
Viaarxiv icon

Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation

Add code
Oct 29, 2019
Figure 1 for Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
Figure 2 for Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
Figure 3 for Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
Figure 4 for Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
Viaarxiv icon

Incremental processing of noisy user utterances in the spoken language understanding task

Add code
Sep 30, 2019
Figure 1 for Incremental processing of noisy user utterances in the spoken language understanding task
Figure 2 for Incremental processing of noisy user utterances in the spoken language understanding task
Figure 3 for Incremental processing of noisy user utterances in the spoken language understanding task
Figure 4 for Incremental processing of noisy user utterances in the spoken language understanding task
Viaarxiv icon