Alert button
Picture for Boris Ginsburg

Boris Ginsburg

Alert button

AmberNet: A Compact End-to-End Model for Spoken Language Identification

Add code
Bookmark button
Alert button
Oct 27, 2022
Fei Jia, Nithin Rao Koluguri, Jagadeesh Balam, Boris Ginsburg

Figure 1 for AmberNet: A Compact End-to-End Model for Spoken Language Identification
Figure 2 for AmberNet: A Compact End-to-End Model for Spoken Language Identification
Figure 3 for AmberNet: A Compact End-to-End Model for Spoken Language Identification
Figure 4 for AmberNet: A Compact End-to-End Model for Spoken Language Identification
Viaarxiv icon

Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition

Add code
Bookmark button
Alert button
Oct 06, 2022
Somshubra Majumdar, Shantanu Acharya, Vitaly Lavrukhin, Boris Ginsburg

Figure 1 for Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition
Figure 2 for Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition
Figure 3 for Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition
Viaarxiv icon

Thutmose Tagger: Single-pass neural model for Inverse Text Normalization

Add code
Bookmark button
Alert button
Jul 29, 2022
Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg

Figure 1 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Figure 2 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Figure 3 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Figure 4 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Viaarxiv icon

BigVGAN: A Universal Neural Vocoder with Large-Scale Training

Add code
Bookmark button
Alert button
Jun 09, 2022
Sang-gil Lee, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon

Figure 1 for BigVGAN: A Universal Neural Vocoder with Large-Scale Training
Figure 2 for BigVGAN: A Universal Neural Vocoder with Large-Scale Training
Figure 3 for BigVGAN: A Universal Neural Vocoder with Large-Scale Training
Figure 4 for BigVGAN: A Universal Neural Vocoder with Large-Scale Training
Viaarxiv icon

Multi-scale Speaker Diarization with Dynamic Scale Weighting

Add code
Bookmark button
Alert button
Mar 30, 2022
Tae Jin Park, Nithin Rao Koluguri, Jagadeesh Balam, Boris Ginsburg

Figure 1 for Multi-scale Speaker Diarization with Dynamic Scale Weighting
Figure 2 for Multi-scale Speaker Diarization with Dynamic Scale Weighting
Figure 3 for Multi-scale Speaker Diarization with Dynamic Scale Weighting
Figure 4 for Multi-scale Speaker Diarization with Dynamic Scale Weighting
Viaarxiv icon

Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization

Add code
Bookmark button
Alert button
Mar 29, 2022
Evelina Bakhturina, Yang Zhang, Boris Ginsburg

Figure 1 for Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization
Figure 2 for Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization
Figure 3 for Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization
Figure 4 for Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization
Viaarxiv icon

Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings

Add code
Bookmark button
Alert button
Oct 22, 2021
Oktai Tatanov, Stanislav Beliaev, Boris Ginsburg

Figure 1 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Figure 2 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Figure 3 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Figure 4 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Viaarxiv icon

Adapting TTS models For New Speakers using Transfer Learning

Add code
Bookmark button
Alert button
Oct 12, 2021
Paarth Neekhara, Jason Li, Boris Ginsburg

Figure 1 for Adapting TTS models For New Speakers using Transfer Learning
Figure 2 for Adapting TTS models For New Speakers using Transfer Learning
Viaarxiv icon

TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context

Add code
Bookmark button
Alert button
Oct 08, 2021
Nithin Rao Koluguri, Taejin Park, Boris Ginsburg

Figure 1 for TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context
Figure 2 for TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context
Figure 3 for TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context
Figure 4 for TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context
Viaarxiv icon