Alert button
Picture for Catalin Zorila

Catalin Zorila

Alert button

Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition

Add code
Bookmark button
Alert button
Apr 24, 2023
Mohan Li, Rama Doddipatla, Catalin Zorila

Figure 1 for Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Figure 2 for Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Figure 3 for Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Figure 4 for Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Viaarxiv icon

Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 09, 2022
Catalin Zorila, Rama Doddipatla

Figure 1 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Figure 2 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Figure 3 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Figure 4 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Viaarxiv icon

On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training

Add code
Bookmark button
Alert button
May 03, 2022
Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker

Figure 1 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Figure 2 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Figure 3 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Figure 4 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Viaarxiv icon

Transformer-based Streaming ASR with Cumulative Attention

Add code
Bookmark button
Alert button
Mar 11, 2022
Mohan Li, Shucong Zhang, Catalin Zorila, Rama Doddipatla

Figure 1 for Transformer-based Streaming ASR with Cumulative Attention
Figure 2 for Transformer-based Streaming ASR with Cumulative Attention
Figure 3 for Transformer-based Streaming ASR with Cumulative Attention
Figure 4 for Transformer-based Streaming ASR with Cumulative Attention
Viaarxiv icon

Monaural source separation: From anechoic to reverberant environments

Add code
Bookmark button
Alert button
Nov 15, 2021
Tobias Cord-Landwehr, Christoph Boeddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach

Figure 1 for Monaural source separation: From anechoic to reverberant environments
Figure 2 for Monaural source separation: From anechoic to reverberant environments
Figure 3 for Monaural source separation: From anechoic to reverberant environments
Figure 4 for Monaural source separation: From anechoic to reverberant environments
Viaarxiv icon

Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation

Add code
Bookmark button
Alert button
Jun 16, 2021
Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker

Figure 1 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 2 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 3 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 4 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Viaarxiv icon

Head-synchronous Decoding for Transformer-based Streaming ASR

Add code
Bookmark button
Alert button
Apr 26, 2021
Mohan Li, Catalin Zorila, Rama Doddipatla

Figure 1 for Head-synchronous Decoding for Transformer-based Streaming ASR
Figure 2 for Head-synchronous Decoding for Transformer-based Streaming ASR
Figure 3 for Head-synchronous Decoding for Transformer-based Streaming ASR
Figure 4 for Head-synchronous Decoding for Transformer-based Streaming ASR
Viaarxiv icon

Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism

Add code
Bookmark button
Alert button
Feb 07, 2021
Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker

Figure 1 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Figure 2 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Figure 3 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Figure 4 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Viaarxiv icon

On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments

Add code
Bookmark button
Alert button
Nov 11, 2020
Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker

Figure 1 for On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Figure 2 for On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Figure 3 for On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Figure 4 for On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Viaarxiv icon

An Investigation into the Effectiveness of Enhancement in ASR Training and Test for CHiME-5 Dinner Party Transcription

Add code
Bookmark button
Alert button
Sep 26, 2019
Catalin Zorila, Christoph Boeddeker, Rama Doddipatla, Reinhold Haeb-Umbach

Figure 1 for An Investigation into the Effectiveness of Enhancement in ASR Training and Test for CHiME-5 Dinner Party Transcription
Figure 2 for An Investigation into the Effectiveness of Enhancement in ASR Training and Test for CHiME-5 Dinner Party Transcription
Figure 3 for An Investigation into the Effectiveness of Enhancement in ASR Training and Test for CHiME-5 Dinner Party Transcription
Figure 4 for An Investigation into the Effectiveness of Enhancement in ASR Training and Test for CHiME-5 Dinner Party Transcription
Viaarxiv icon