Picture for Michael L. Seltzer

Michael L. Seltzer

Sid

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

End-to-End Speech Recognition Contextualization with Large Language Models

Add code
Sep 19, 2023
Figure 1 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 2 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 3 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 4 for End-to-End Speech Recognition Contextualization with Large Language Models
Viaarxiv icon

Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding

Add code
Jul 22, 2023
Viaarxiv icon

Improving Fast-slow Encoder based Transducer with Streaming Deliberation

Add code
Dec 15, 2022
Viaarxiv icon

Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities

Add code
Nov 10, 2022
Figure 1 for Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities
Figure 2 for Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities
Figure 3 for Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities
Figure 4 for Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities
Viaarxiv icon

Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers

Add code
Nov 02, 2022
Viaarxiv icon

Deliberation Model for On-Device Spoken Language Understanding

Add code
Apr 04, 2022
Figure 1 for Deliberation Model for On-Device Spoken Language Understanding
Figure 2 for Deliberation Model for On-Device Spoken Language Understanding
Figure 3 for Deliberation Model for On-Device Spoken Language Understanding
Figure 4 for Deliberation Model for On-Device Spoken Language Understanding
Viaarxiv icon

Neural-FST Class Language Model for End-to-End Speech Recognition

Add code
Jan 31, 2022
Figure 1 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 2 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 3 for Neural-FST Class Language Model for End-to-End Speech Recognition
Viaarxiv icon

Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric

Add code
Oct 11, 2021
Figure 1 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 2 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 3 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 4 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Viaarxiv icon

Collaborative Training of Acoustic Encoders for Speech Recognition

Add code
Jul 13, 2021
Figure 1 for Collaborative Training of Acoustic Encoders for Speech Recognition
Figure 2 for Collaborative Training of Acoustic Encoders for Speech Recognition
Figure 3 for Collaborative Training of Acoustic Encoders for Speech Recognition
Viaarxiv icon