Picture for Brian Kingsbury

Brian Kingsbury

Representation based meta-learning for few-shot spoken intent recognition

Add code
Jun 29, 2021
Figure 1 for Representation based meta-learning for few-shot spoken intent recognition
Figure 2 for Representation based meta-learning for few-shot spoken intent recognition
Figure 3 for Representation based meta-learning for few-shot spoken intent recognition
Figure 4 for Representation based meta-learning for few-shot spoken intent recognition
Viaarxiv icon

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos

Add code
May 05, 2021
Figure 1 for Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
Figure 2 for Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
Figure 3 for Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
Figure 4 for Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
Viaarxiv icon

On the limit of English conversational speech recognition

Add code
May 03, 2021
Figure 1 for On the limit of English conversational speech recognition
Figure 2 for On the limit of English conversational speech recognition
Figure 3 for On the limit of English conversational speech recognition
Viaarxiv icon

RNN Transducer Models For Spoken Language Understanding

Add code
Apr 08, 2021
Figure 1 for RNN Transducer Models For Spoken Language Understanding
Figure 2 for RNN Transducer Models For Spoken Language Understanding
Figure 3 for RNN Transducer Models For Spoken Language Understanding
Figure 4 for RNN Transducer Models For Spoken Language Understanding
Viaarxiv icon

Advancing RNN Transducer Technology for Speech Recognition

Add code
Mar 17, 2021
Figure 1 for Advancing RNN Transducer Technology for Speech Recognition
Figure 2 for Advancing RNN Transducer Technology for Speech Recognition
Figure 3 for Advancing RNN Transducer Technology for Speech Recognition
Figure 4 for Advancing RNN Transducer Technology for Speech Recognition
Viaarxiv icon

Federated Acoustic Modeling For Automatic Speech Recognition

Add code
Feb 08, 2021
Figure 1 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 2 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 3 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 4 for Federated Acoustic Modeling For Automatic Speech Recognition
Viaarxiv icon

End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features

Add code
Nov 16, 2020
Figure 1 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 2 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 3 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Figure 4 for End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Viaarxiv icon

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

Add code
Oct 08, 2020
Figure 1 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 2 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 3 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 4 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Viaarxiv icon

End-to-End Spoken Language Understanding Without Full Transcripts

Add code
Sep 30, 2020
Figure 1 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 2 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 3 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 4 for End-to-End Spoken Language Understanding Without Full Transcripts
Viaarxiv icon

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos

Add code
Jun 16, 2020
Figure 1 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Figure 2 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Figure 3 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Figure 4 for AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Viaarxiv icon