Picture for Yatharth Saraf

Yatharth Saraf

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion

Add code
Apr 05, 2021
Figure 1 for Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Figure 2 for Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Figure 3 for Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Figure 4 for Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Viaarxiv icon

A Multi-View Approach To Audio-Visual Speaker Verification

Add code
Feb 11, 2021
Figure 1 for A Multi-View Approach To Audio-Visual Speaker Verification
Figure 2 for A Multi-View Approach To Audio-Visual Speaker Verification
Figure 3 for A Multi-View Approach To Audio-Visual Speaker Verification
Figure 4 for A Multi-View Approach To Audio-Visual Speaker Verification
Viaarxiv icon

Improving RNN Transducer Based ASR with Auxiliary Tasks

Add code
Nov 09, 2020
Figure 1 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 2 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 3 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 4 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Viaarxiv icon

Contextual RNN-T For Open Domain ASR

Add code
Jun 04, 2020
Figure 1 for Contextual RNN-T For Open Domain ASR
Figure 2 for Contextual RNN-T For Open Domain ASR
Figure 3 for Contextual RNN-T For Open Domain ASR
Figure 4 for Contextual RNN-T For Open Domain ASR
Viaarxiv icon

Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces

Add code
May 19, 2020
Figure 1 for Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Figure 2 for Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Figure 3 for Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Figure 4 for Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Viaarxiv icon

Large scale weakly and semi-supervised learning for low-resource video ASR

Add code
May 16, 2020
Figure 1 for Large scale weakly and semi-supervised learning for low-resource video ASR
Figure 2 for Large scale weakly and semi-supervised learning for low-resource video ASR
Figure 3 for Large scale weakly and semi-supervised learning for low-resource video ASR
Viaarxiv icon

Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model

Add code
May 15, 2020
Figure 1 for Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model
Figure 2 for Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model
Figure 3 for Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model
Figure 4 for Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model
Viaarxiv icon

Training ASR models by Generation of Contextual Information

Add code
Oct 27, 2019
Figure 1 for Training ASR models by Generation of Contextual Information
Figure 2 for Training ASR models by Generation of Contextual Information
Figure 3 for Training ASR models by Generation of Contextual Information
Figure 4 for Training ASR models by Generation of Contextual Information
Viaarxiv icon

Multilingual ASR with Massive Data Augmentation

Add code
Sep 14, 2019
Figure 1 for Multilingual ASR with Massive Data Augmentation
Figure 2 for Multilingual ASR with Massive Data Augmentation
Viaarxiv icon