Picture for Brian Kingsbury

Brian Kingsbury

Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models

Add code
Feb 26, 2022
Figure 1 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 2 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 3 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 4 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Viaarxiv icon

A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets

Add code
Feb 21, 2022
Figure 1 for A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets
Figure 2 for A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets
Figure 3 for A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets
Viaarxiv icon

Improving End-to-End Models for Set Prediction in Spoken Language Understanding

Add code
Jan 28, 2022
Figure 1 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 2 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 3 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 4 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Viaarxiv icon

Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval

Add code
Dec 08, 2021
Figure 1 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Figure 2 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Figure 3 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Figure 4 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Viaarxiv icon

Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent

Add code
Dec 02, 2021
Figure 1 for Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent
Figure 2 for Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent
Figure 3 for Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent
Figure 4 for Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent
Viaarxiv icon

Cascaded Multilingual Audio-Visual Learning from Videos

Add code
Nov 08, 2021
Figure 1 for Cascaded Multilingual Audio-Visual Learning from Videos
Figure 2 for Cascaded Multilingual Audio-Visual Learning from Videos
Figure 3 for Cascaded Multilingual Audio-Visual Learning from Videos
Figure 4 for Cascaded Multilingual Audio-Visual Learning from Videos
Viaarxiv icon

Asynchronous Decentralized Distributed Training of Acoustic Models

Add code
Oct 21, 2021
Figure 1 for Asynchronous Decentralized Distributed Training of Acoustic Models
Figure 2 for Asynchronous Decentralized Distributed Training of Acoustic Models
Figure 3 for Asynchronous Decentralized Distributed Training of Acoustic Models
Figure 4 for Asynchronous Decentralized Distributed Training of Acoustic Models
Viaarxiv icon

4-bit Quantization of LSTM-based Speech Recognition Models

Add code
Aug 27, 2021
Figure 1 for 4-bit Quantization of LSTM-based Speech Recognition Models
Figure 2 for 4-bit Quantization of LSTM-based Speech Recognition Models
Figure 3 for 4-bit Quantization of LSTM-based Speech Recognition Models
Figure 4 for 4-bit Quantization of LSTM-based Speech Recognition Models
Viaarxiv icon

Reducing Exposure Bias in Training Recurrent Neural Network Transducers

Add code
Aug 24, 2021
Figure 1 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Figure 2 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Figure 3 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Figure 4 for Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Viaarxiv icon

Integrating Dialog History into End-to-End Spoken Language Understanding Systems

Add code
Aug 18, 2021
Figure 1 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Figure 2 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Figure 3 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Figure 4 for Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Viaarxiv icon