Alert button
Picture for Hong-Kwang J. Kuo

Hong-Kwang J. Kuo

Alert button

Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems

Add code
Bookmark button
Alert button
Apr 11, 2022
Vishal Sunder, Eric Fosler-Lussier, Samuel Thomas, Hong-Kwang J. Kuo, Brian Kingsbury

Figure 1 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Figure 2 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Figure 3 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Viaarxiv icon

Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding

Add code
Bookmark button
Alert button
Apr 11, 2022
Vishal Sunder, Samuel Thomas, Hong-Kwang J. Kuo, Jatin Ganhotra, Brian Kingsbury, Eric Fosler-Lussier

Figure 1 for Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding
Figure 2 for Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding
Figure 3 for Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding
Figure 4 for Towards End-to-End Integration of Dialog History for Improved Spoken Language Understanding
Viaarxiv icon

Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems

Add code
Bookmark button
Alert button
Feb 26, 2022
Samuel Thomas, Hong-Kwang J. Kuo, Brian Kingsbury, George Saon

Figure 1 for Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
Figure 2 for Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
Figure 3 for Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
Figure 4 for Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
Viaarxiv icon

Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models

Add code
Bookmark button
Alert button
Feb 26, 2022
Samuel Thomas, Brian Kingsbury, George Saon, Hong-Kwang J. Kuo

Figure 1 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 2 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 3 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 4 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Viaarxiv icon

Improving End-to-End Models for Set Prediction in Spoken Language Understanding

Add code
Bookmark button
Alert button
Jan 28, 2022
Hong-Kwang J. Kuo, Zoltan Tuske, Samuel Thomas, Brian Kingsbury, George Saon

Figure 1 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 2 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 3 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 4 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Viaarxiv icon

Integrating Dialog History into End-to-End Spoken Language Understanding Systems

Add code
Bookmark button
Alert button
Aug 18, 2021
Jatin Ganhotra, Samuel Thomas, Hong-Kwang J. Kuo, Sachindra Joshi, George Saon, Zoltán Tüske, Brian Kingsbury

Figure 1 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Figure 2 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Figure 3 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Figure 4 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Viaarxiv icon

RNN Transducer Models For Spoken Language Understanding

Add code
Bookmark button
Alert button
Apr 08, 2021
Samuel Thomas, Hong-Kwang J. Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory

Figure 1 for RNN Transducer Models For Spoken Language Understanding
Figure 2 for RNN Transducer Models For Spoken Language Understanding
Figure 3 for RNN Transducer Models For Spoken Language Understanding
Figure 4 for RNN Transducer Models For Spoken Language Understanding
Viaarxiv icon

End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features

Add code
Bookmark button
Alert button
Nov 16, 2020
Edmilson Morais, Hong-Kwang J. Kuo, Samuel Thomas, Zoltan Tuske, Brian Kingsbury

Figure 1 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 2 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 3 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 4 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Viaarxiv icon

End-to-End Spoken Language Understanding Without Full Transcripts

Add code
Bookmark button
Alert button
Sep 30, 2020
Hong-Kwang J. Kuo, Zoltán Tüske, Samuel Thomas, Yinghui Huang, Kartik Audhkhasi, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory, Luis Lastras

Figure 1 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 2 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 3 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 4 for End-to-End Spoken Language Understanding Without Full Transcripts
Viaarxiv icon

The IBM 2016 English Conversational Telephone Speech Recognition System

Add code
Bookmark button
Alert button
Jun 22, 2016
George Saon, Tom Sercu, Steven Rennie, Hong-Kwang J. Kuo

Figure 1 for The IBM 2016 English Conversational Telephone Speech Recognition System
Figure 2 for The IBM 2016 English Conversational Telephone Speech Recognition System
Figure 3 for The IBM 2016 English Conversational Telephone Speech Recognition System
Figure 4 for The IBM 2016 English Conversational Telephone Speech Recognition System
Viaarxiv icon