Picture for Ilya Sklyar

Ilya Sklyar

Anatomy of Industrial Scale Multilingual ASR

Add code
Apr 16, 2024
Viaarxiv icon

Two-pass Endpoint Detection for Speech Recognition

Jan 17, 2024
Viaarxiv icon

Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech

May 10, 2022
Figure 1 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 2 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 3 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 4 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Viaarxiv icon

Multi-turn RNN-T for streaming recognition of multi-party speech

Dec 19, 2021
Figure 1 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 2 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 3 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 4 for Multi-turn RNN-T for streaming recognition of multi-party speech
Viaarxiv icon

Streaming Multi-speaker ASR with RNN-T

Nov 23, 2020
Figure 1 for Streaming Multi-speaker ASR with RNN-T
Figure 2 for Streaming Multi-speaker ASR with RNN-T
Figure 3 for Streaming Multi-speaker ASR with RNN-T
Viaarxiv icon

Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech

May 09, 2019
Figure 1 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 2 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 3 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 4 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Viaarxiv icon