Alert button
Picture for Michael Picheny

Michael Picheny

Alert button

Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs

Apr 07, 2021
Sujeong Cha, Wangrui Hou, Hyun Jung, My Phung, Michael Picheny, Hong-Kwang Kuo, Samuel Thomas, Edmilson Morais

Figure 1 for Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
Figure 2 for Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
Figure 3 for Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
Figure 4 for Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
Viaarxiv icon

Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio

Apr 03, 2021
Jeffrey Tumminia, Amanda Kuznecov, Sophia Tsilerides, Ilana Weinstein, Brian McFee, Michael Picheny, Aaron R. Kaufman

Figure 1 for Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio
Figure 2 for Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio
Figure 3 for Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio
Figure 4 for Diarization of Legal Proceedings. Identifying and Transcribing Judicial Speech from Recorded Court Audio
Viaarxiv icon

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

Oct 08, 2020
Yinghui Huang, Hong-Kwang Kuo, Samuel Thomas, Zvi Kons, Kartik Audhkhasi, Brian Kingsbury, Ron Hoory, Michael Picheny

Figure 1 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 2 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 3 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 4 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Viaarxiv icon

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos

Jun 16, 2020
Andrew Rouditchenko, Angie Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogerio Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James Glass

Figure 1 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Figure 2 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Figure 3 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Figure 4 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Viaarxiv icon

Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition

Feb 24, 2020
Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David Kung

Figure 1 for Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
Figure 2 for Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
Figure 3 for Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
Figure 4 for Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
Viaarxiv icon

Improving Efficiency in Large-Scale Decentralized Distributed Training

Feb 04, 2020
Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David Kung, Michael Picheny

Figure 1 for Improving Efficiency in Large-Scale Decentralized Distributed Training
Figure 2 for Improving Efficiency in Large-Scale Decentralized Distributed Training
Figure 3 for Improving Efficiency in Large-Scale Decentralized Distributed Training
Figure 4 for Improving Efficiency in Large-Scale Decentralized Distributed Training
Viaarxiv icon

Challenging the Boundaries of Speech Recognition: The MALACH Corpus

Aug 09, 2019
Michael Picheny, Zóltan Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon

Figure 1 for Challenging the Boundaries of Speech Recognition: The MALACH Corpus
Figure 2 for Challenging the Boundaries of Speech Recognition: The MALACH Corpus
Viaarxiv icon

Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition

Jul 10, 2019
Khoi-Nguyen C. Mac, Xiaodong Cui, Wei Zhang, Michael Picheny

Figure 1 for Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition
Figure 2 for Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition
Figure 3 for Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition
Viaarxiv icon

Acoustic Model Optimization Based On Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition

Jul 10, 2019
Xiaodong Cui, Michael Picheny

Figure 1 for Acoustic Model Optimization Based On Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition
Figure 2 for Acoustic Model Optimization Based On Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition
Figure 3 for Acoustic Model Optimization Based On Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition
Figure 4 for Acoustic Model Optimization Based On Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition
Viaarxiv icon

A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition

Jul 10, 2019
Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David Kung, Michael Picheny

Figure 1 for A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition
Figure 2 for A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition
Figure 3 for A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition
Figure 4 for A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition
Viaarxiv icon