Picture for Hong-Kwang J. Kuo

Hong-Kwang J. Kuo

Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems

Add code
Apr 11, 2022
Figure 1 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Figure 2 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Figure 3 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Viaarxiv icon

Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding

Add code
Apr 11, 2022
Figure 1 for Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding
Figure 2 for Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding
Figure 3 for Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding
Figure 4 for Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding
Viaarxiv icon

Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems

Add code
Feb 26, 2022
Figure 1 for Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
Figure 2 for Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
Figure 3 for Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
Figure 4 for Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
Viaarxiv icon

Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models

Add code
Feb 26, 2022
Figure 1 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 2 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 3 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 4 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Viaarxiv icon

Improving End-to-End Models for Set Prediction in Spoken Language Understanding

Add code
Jan 28, 2022
Figure 1 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 2 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 3 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 4 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Viaarxiv icon

Integrating Dialog History into End-to-End Spoken Language Understanding Systems

Add code
Aug 18, 2021
Figure 1 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Figure 2 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Figure 3 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Figure 4 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Viaarxiv icon

RNN Transducer Models For Spoken Language Understanding

Add code
Apr 08, 2021
Figure 1 for RNN Transducer Models For Spoken Language Understanding
Figure 2 for RNN Transducer Models For Spoken Language Understanding
Figure 3 for RNN Transducer Models For Spoken Language Understanding
Figure 4 for RNN Transducer Models For Spoken Language Understanding
Viaarxiv icon

End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features

Add code
Nov 16, 2020
Figure 1 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 2 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 3 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 4 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Viaarxiv icon

End-to-End Spoken Language Understanding Without Full Transcripts

Add code
Sep 30, 2020
Figure 1 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 2 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 3 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 4 for End-to-End Spoken Language Understanding Without Full Transcripts
Viaarxiv icon

The IBM 2016 English Conversational Telephone Speech Recognition System

Add code
Jun 22, 2016
Figure 1 for The IBM 2016 English Conversational Telephone Speech Recognition System
Figure 2 for The IBM 2016 English Conversational Telephone Speech Recognition System
Figure 3 for The IBM 2016 English Conversational Telephone Speech Recognition System
Figure 4 for The IBM 2016 English Conversational Telephone Speech Recognition System
Viaarxiv icon