Alert button
Picture for Marc Delcroix

Marc Delcroix

Alert button

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization

Add code
Bookmark button
Alert button
May 23, 2023
Marc Delcroix, Naohiro Tawara, Mireia Diez, Federico Landini, Anna Silnova, Atsunori Ogawa, Tomohiro Nakatani, Lukas Burget, Shoko Araki

Figure 1 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Figure 2 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Figure 3 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Viaarxiv icon

Leveraging Large Text Corpora for End-to-End Speech Summarization

Add code
Bookmark button
Alert button
Mar 02, 2023
Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Tomohiro Tanaka, Atsunori Ogawa, Marc Delcroix, Ryo Masumura

Figure 1 for Leveraging Large Text Corpora for End-to-End Speech Summarization
Figure 2 for Leveraging Large Text Corpora for End-to-End Speech Summarization
Figure 3 for Leveraging Large Text Corpora for End-to-End Speech Summarization
Figure 4 for Leveraging Large Text Corpora for End-to-End Speech Summarization
Viaarxiv icon

Neural Target Speech Extraction: An Overview

Add code
Bookmark button
Alert button
Jan 31, 2023
Katerina Zmolikova, Marc Delcroix, Tsubasa Ochiai, Keisuke Kinoshita, Jan Černocký, Dong Yu

Figure 1 for Neural Target Speech Extraction: An Overview
Figure 2 for Neural Target Speech Extraction: An Overview
Figure 3 for Neural Target Speech Extraction: An Overview
Figure 4 for Neural Target Speech Extraction: An Overview
Viaarxiv icon

On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems

Add code
Bookmark button
Alert button
Nov 29, 2022
Thilo von Neumann, Christoph Boeddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach

Figure 1 for On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems
Figure 2 for On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems
Figure 3 for On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems
Figure 4 for On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems
Viaarxiv icon

Streaming Target-Speaker ASR with Neural Transducer

Add code
Bookmark button
Alert button
Sep 19, 2022
Takafumi Moriya, Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Takahiro Shinozaki

Figure 1 for Streaming Target-Speaker ASR with Neural Transducer
Figure 2 for Streaming Target-Speaker ASR with Neural Transducer
Figure 3 for Streaming Target-Speaker ASR with Neural Transducer
Figure 4 for Streaming Target-Speaker ASR with Neural Transducer
Viaarxiv icon

Streaming End-to-End Target Speaker ASR

Add code
Bookmark button
Alert button
Sep 09, 2022
Takafumi Moriya, Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Takahiro Shinozaki

Figure 1 for Streaming End-to-End Target Speaker ASR
Figure 2 for Streaming End-to-End Target Speaker ASR
Figure 3 for Streaming End-to-End Target Speaker ASR
Figure 4 for Streaming End-to-End Target Speaker ASR
Viaarxiv icon

Analysis of impact of emotions on target speech extraction and speech separation

Add code
Bookmark button
Alert button
Aug 15, 2022
Ján Švec, Kateřina Žmolíková, Martin Kocour, Marc Delcroix, Tsubasa Ochiai, Ladislav Mošner, Jan Černocký

Figure 1 for Analysis of impact of emotions on target speech extraction and speech separation
Figure 2 for Analysis of impact of emotions on target speech extraction and speech separation
Figure 3 for Analysis of impact of emotions on target speech extraction and speech separation
Figure 4 for Analysis of impact of emotions on target speech extraction and speech separation
Viaarxiv icon

Utterance-by-utterance overlap-aware neural diarization with Graph-PIT

Add code
Bookmark button
Alert button
Jul 28, 2022
Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Christoph Boeddeker, Reinhold Haeb-Umbach

Figure 1 for Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
Figure 2 for Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
Viaarxiv icon

ConceptBeam: Concept Driven Target Speech Extraction

Add code
Bookmark button
Alert button
Jul 25, 2022
Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada, Kunio Kashino

Figure 1 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 2 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 3 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 4 for ConceptBeam: Concept Driven Target Speech Extraction
Viaarxiv icon

Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations

Add code
Bookmark button
Alert button
Jun 16, 2022
Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Takafumi Moriya, Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Ryo Masumura

Figure 1 for Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
Figure 2 for Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
Figure 3 for Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
Figure 4 for Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
Viaarxiv icon