Alert button
Picture for Zoltan Tuske

Zoltan Tuske

Alert button

Alternating Weak Triphone/BPE Alignment Supervision from Hybrid Model Improves End-to-End ASR

Add code
Bookmark button
Alert button
Feb 23, 2024
Jintao Jiang, Yingbo Gao, Mohammad Zeineldeen, Zoltan Tuske

Viaarxiv icon

Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR

Add code
Bookmark button
Alert button
Nov 30, 2023
Jintao Jiang, Yingbo Gao, Zoltan Tuske

Figure 1 for Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Figure 2 for Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Figure 3 for Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Figure 4 for Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Viaarxiv icon

Improving End-to-End Models for Set Prediction in Spoken Language Understanding

Add code
Bookmark button
Alert button
Jan 28, 2022
Hong-Kwang J. Kuo, Zoltan Tuske, Samuel Thomas, Brian Kingsbury, George Saon

Figure 1 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 2 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 3 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 4 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Viaarxiv icon

Reducing Exposure Bias in Training Recurrent Neural Network Transducers

Add code
Bookmark button
Alert button
Aug 24, 2021
Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltan Tuske

Figure 1 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Figure 2 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Figure 3 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Figure 4 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Viaarxiv icon

End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features

Add code
Bookmark button
Alert button
Nov 16, 2020
Edmilson Morais, Hong-Kwang J. Kuo, Samuel Thomas, Zoltan Tuske, Brian Kingsbury

Figure 1 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 2 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 3 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 4 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Viaarxiv icon

English Broadcast News Speech Recognition by Humans and Machines

Add code
Bookmark button
Alert button
Apr 30, 2019
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltan Tuske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko

Figure 1 for English Broadcast News Speech Recognition by Humans and Machines
Figure 2 for English Broadcast News Speech Recognition by Humans and Machines
Figure 3 for English Broadcast News Speech Recognition by Humans and Machines
Figure 4 for English Broadcast News Speech Recognition by Humans and Machines
Viaarxiv icon