Alert button
Picture for Brian Kingsbury

Brian Kingsbury

Alert button

Federated Acoustic Modeling For Automatic Speech Recognition

Add code
Bookmark button
Alert button
Feb 08, 2021
Xiaodong Cui, Songtao Lu, Brian Kingsbury

Figure 1 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 2 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 3 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 4 for Federated Acoustic Modeling For Automatic Speech Recognition
Viaarxiv icon

End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features

Add code
Bookmark button
Alert button
Nov 16, 2020
Edmilson Morais, Hong-Kwang J. Kuo, Samuel Thomas, Zoltan Tuske, Brian Kingsbury

Figure 1 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 2 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 3 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 4 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Viaarxiv icon

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

Add code
Bookmark button
Alert button
Oct 08, 2020
Yinghui Huang, Hong-Kwang Kuo, Samuel Thomas, Zvi Kons, Kartik Audhkhasi, Brian Kingsbury, Ron Hoory, Michael Picheny

Figure 1 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 2 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 3 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 4 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Viaarxiv icon

End-to-End Spoken Language Understanding Without Full Transcripts

Add code
Bookmark button
Alert button
Sep 30, 2020
Hong-Kwang J. Kuo, Zoltán Tüske, Samuel Thomas, Yinghui Huang, Kartik Audhkhasi, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory, Luis Lastras

Figure 1 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 2 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 3 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 4 for End-to-End Spoken Language Understanding Without Full Transcripts
Viaarxiv icon

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos

Add code
Bookmark button
Alert button
Jun 16, 2020
Andrew Rouditchenko, Angie Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogerio Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James Glass

Figure 1 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Figure 2 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Figure 3 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Figure 4 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Viaarxiv icon

Improving Efficiency in Large-Scale Decentralized Distributed Training

Add code
Bookmark button
Alert button
Feb 04, 2020
Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David Kung, Michael Picheny

Figure 1 for Improving Efficiency in Large-Scale Decentralized Distributed Training
Figure 2 for Improving Efficiency in Large-Scale Decentralized Distributed Training
Figure 3 for Improving Efficiency in Large-Scale Decentralized Distributed Training
Figure 4 for Improving Efficiency in Large-Scale Decentralized Distributed Training
Viaarxiv icon

Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300

Add code
Bookmark button
Alert button
Jan 20, 2020
Zoltán Tüske, George Saon, Kartik Audhkhasi, Brian Kingsbury

Figure 1 for Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300
Figure 2 for Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300
Figure 3 for Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300
Figure 4 for Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300
Viaarxiv icon

Challenging the Boundaries of Speech Recognition: The MALACH Corpus

Add code
Bookmark button
Alert button
Aug 09, 2019
Michael Picheny, Zóltan Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon

Figure 1 for Challenging the Boundaries of Speech Recognition: The MALACH Corpus
Figure 2 for Challenging the Boundaries of Speech Recognition: The MALACH Corpus
Viaarxiv icon

A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jul 10, 2019
Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David Kung, Michael Picheny

Figure 1 for A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition
Figure 2 for A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition
Figure 3 for A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition
Figure 4 for A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition
Viaarxiv icon

English Broadcast News Speech Recognition by Humans and Machines

Add code
Bookmark button
Alert button
Apr 30, 2019
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltan Tuske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko

Figure 1 for English Broadcast News Speech Recognition by Humans and Machines
Figure 2 for English Broadcast News Speech Recognition by Humans and Machines
Figure 3 for English Broadcast News Speech Recognition by Humans and Machines
Figure 4 for English Broadcast News Speech Recognition by Humans and Machines
Viaarxiv icon