Alert button
Picture for Gakuto Kurata

Gakuto Kurata

Alert button

Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems

Add code
Bookmark button
Alert button
Sep 07, 2023
Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Masayasu Muraoka, George Saon

Viaarxiv icon

Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems

Add code
Bookmark button
Alert button
Apr 01, 2022
Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon

Figure 1 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Figure 2 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Figure 3 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Figure 4 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Viaarxiv icon

Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing

Add code
Bookmark button
Alert button
Mar 29, 2022
Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata

Figure 1 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 2 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 3 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 4 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Viaarxiv icon

Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data

Add code
Bookmark button
Alert button
Dec 16, 2021
Tohru Nagano, Takashi Fukuda, Gakuto Kurata

Figure 1 for Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data
Figure 2 for Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data
Figure 3 for Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data
Viaarxiv icon

RNN Transducer Models For Spoken Language Understanding

Add code
Bookmark button
Alert button
Apr 08, 2021
Samuel Thomas, Hong-Kwang J. Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory

Figure 1 for RNN Transducer Models For Spoken Language Understanding
Figure 2 for RNN Transducer Models For Spoken Language Understanding
Figure 3 for RNN Transducer Models For Spoken Language Understanding
Figure 4 for RNN Transducer Models For Spoken Language Understanding
Viaarxiv icon

End-to-End Spoken Language Understanding Without Full Transcripts

Add code
Bookmark button
Alert button
Sep 30, 2020
Hong-Kwang J. Kuo, Zoltán Tüske, Samuel Thomas, Yinghui Huang, Kartik Audhkhasi, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory, Luis Lastras

Figure 1 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 2 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 3 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 4 for End-to-End Spoken Language Understanding Without Full Transcripts
Viaarxiv icon

English Broadcast News Speech Recognition by Humans and Machines

Add code
Bookmark button
Alert button
Apr 30, 2019
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltan Tuske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko

Figure 1 for English Broadcast News Speech Recognition by Humans and Machines
Figure 2 for English Broadcast News Speech Recognition by Humans and Machines
Figure 3 for English Broadcast News Speech Recognition by Humans and Machines
Figure 4 for English Broadcast News Speech Recognition by Humans and Machines
Viaarxiv icon

Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation

Add code
Bookmark button
Alert button
Apr 17, 2019
Gakuto Kurata, Kartik Audhkhasi

Figure 1 for Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Figure 2 for Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Figure 3 for Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Figure 4 for Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Viaarxiv icon

Language Modeling with Highway LSTM

Add code
Bookmark button
Alert button
Sep 19, 2017
Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy

Figure 1 for Language Modeling with Highway LSTM
Figure 2 for Language Modeling with Highway LSTM
Figure 3 for Language Modeling with Highway LSTM
Figure 4 for Language Modeling with Highway LSTM
Viaarxiv icon

English Conversational Telephone Speech Recognition by Humans and Machines

Add code
Bookmark button
Alert button
Mar 06, 2017
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall

Figure 1 for English Conversational Telephone Speech Recognition by Humans and Machines
Figure 2 for English Conversational Telephone Speech Recognition by Humans and Machines
Figure 3 for English Conversational Telephone Speech Recognition by Humans and Machines
Figure 4 for English Conversational Telephone Speech Recognition by Humans and Machines
Viaarxiv icon