Picture for Suyoun Kim

Suyoun Kim

Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding

Add code
Apr 05, 2021
Figure 1 for Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Figure 2 for Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Figure 3 for Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Figure 4 for Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Viaarxiv icon

Improving RNN Transducer Based ASR with Auxiliary Tasks

Add code
Nov 09, 2020
Figure 1 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 2 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 3 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 4 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Viaarxiv icon

Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer

Add code
Oct 26, 2020
Figure 1 for Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Figure 2 for Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Figure 3 for Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Figure 4 for Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Viaarxiv icon

Cross-Attention End-to-End ASR for Two-Party Conversations

Add code
Jul 24, 2019
Figure 1 for Cross-Attention End-to-End ASR for Two-Party Conversations
Figure 2 for Cross-Attention End-to-End ASR for Two-Party Conversations
Figure 3 for Cross-Attention End-to-End ASR for Two-Party Conversations
Viaarxiv icon

Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion

Add code
Jun 27, 2019
Figure 1 for Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Figure 2 for Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Figure 3 for Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Figure 4 for Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Viaarxiv icon

Acoustic-to-Word Models with Conversational Context Information

Add code
May 21, 2019
Figure 1 for Acoustic-to-Word Models with Conversational Context Information
Figure 2 for Acoustic-to-Word Models with Conversational Context Information
Figure 3 for Acoustic-to-Word Models with Conversational Context Information
Figure 4 for Acoustic-to-Word Models with Conversational Context Information
Viaarxiv icon

Improved training for online end-to-end speech recognition systems

Add code
Aug 30, 2018
Figure 1 for Improved training for online end-to-end speech recognition systems
Figure 2 for Improved training for online end-to-end speech recognition systems
Figure 3 for Improved training for online end-to-end speech recognition systems
Viaarxiv icon

Dialog-context aware end-to-end speech recognition

Add code
Aug 07, 2018
Figure 1 for Dialog-context aware end-to-end speech recognition
Figure 2 for Dialog-context aware end-to-end speech recognition
Figure 3 for Dialog-context aware end-to-end speech recognition
Figure 4 for Dialog-context aware end-to-end speech recognition
Viaarxiv icon

Towards Language-Universal End-to-End Speech Recognition

Add code
Nov 06, 2017
Figure 1 for Towards Language-Universal End-to-End Speech Recognition
Figure 2 for Towards Language-Universal End-to-End Speech Recognition
Figure 3 for Towards Language-Universal End-to-End Speech Recognition
Figure 4 for Towards Language-Universal End-to-End Speech Recognition
Viaarxiv icon

Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning

Add code
Jan 31, 2017
Figure 1 for Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Figure 2 for Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Figure 3 for Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Figure 4 for Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Viaarxiv icon