Alert button
Picture for Ilya Sklyar

Ilya Sklyar

Alert button

Anatomy of Industrial Scale Multilingual ASR

Add code
Bookmark button
Alert button
Apr 16, 2024
Francis McCann Ramirez, Luka Chkhetiani, Andrew Ehrenberg, Robert McHardy, Rami Botros, Yash Khare, Andrea Vanzo, Taufiquzzaman Peyash, Gabriel Oexle, Michael Liang, Ilya Sklyar, Enver Fakhan, Ahmed Etefy, Daniel McCrystal, Sam Flamini, Domenic Donato, Takuya Yoshioka

Viaarxiv icon

Two-pass Endpoint Detection for Speech Recognition

Add code
Bookmark button
Alert button
Jan 17, 2024
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow

Viaarxiv icon

Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech

Add code
Bookmark button
Alert button
May 10, 2022
Ilya Sklyar, Anna Piunova, Christian Osendorfer

Figure 1 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 2 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 3 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 4 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Viaarxiv icon

Multi-turn RNN-T for streaming recognition of multi-party speech

Add code
Bookmark button
Alert button
Dec 19, 2021
Ilya Sklyar, Anna Piunova, Xianrui Zheng, Yulan Liu

Figure 1 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 2 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 3 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 4 for Multi-turn RNN-T for streaming recognition of multi-party speech
Viaarxiv icon

Streaming Multi-speaker ASR with RNN-T

Add code
Bookmark button
Alert button
Nov 23, 2020
Ilya Sklyar, Anna Piunova, Yulan Liu

Figure 1 for Streaming Multi-speaker ASR with RNN-T
Figure 2 for Streaming Multi-speaker ASR with RNN-T
Figure 3 for Streaming Multi-speaker ASR with RNN-T
Viaarxiv icon

Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech

Add code
Bookmark button
Alert button
May 09, 2019
Tobias Menne, Ilya Sklyar, Ralf Schlüter, Hermann Ney

Figure 1 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 2 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 3 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 4 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Viaarxiv icon