Alert button
Picture for Tsubasa Ochiai

Tsubasa Ochiai

Alert button

Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance

Add code
Bookmark button
Alert button
Apr 23, 2024
Tsubasa Ochiai, Kazuma Iwamoto, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri

Viaarxiv icon

Probing Self-supervised Learning Models with Target Speech Extraction

Add code
Bookmark button
Alert button
Feb 17, 2024
Junyi Peng, Marc Delcroix, Tsubasa Ochiai, Oldrich Plchot, Takanori Ashihara, Shoko Araki, Jan Cernocky

Viaarxiv icon

Target Speech Extraction with Pre-trained Self-supervised Learning Models

Add code
Bookmark button
Alert button
Feb 17, 2024
Junyi Peng, Marc Delcroix, Tsubasa Ochiai, Oldrich Plchot, Shoko Araki, Jan Cernocky

Figure 1 for Target Speech Extraction with Pre-trained Self-supervised Learning Models
Figure 2 for Target Speech Extraction with Pre-trained Self-supervised Learning Models
Figure 3 for Target Speech Extraction with Pre-trained Self-supervised Learning Models
Figure 4 for Target Speech Extraction with Pre-trained Self-supervised Learning Models
Viaarxiv icon

Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers

Add code
Bookmark button
Alert button
Feb 05, 2024
Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki, Simon Doclo

Figure 1 for Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
Figure 2 for Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
Figure 3 for Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
Figure 4 for Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
Viaarxiv icon

How does end-to-end speech recognition training impact speech enhancement artifacts?

Add code
Bookmark button
Alert button
Nov 20, 2023
Kazuma Iwamoto, Tsubasa Ochiai, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri

Figure 1 for How does end-to-end speech recognition training impact speech enhancement artifacts?
Figure 2 for How does end-to-end speech recognition training impact speech enhancement artifacts?
Viaarxiv icon

Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss

Add code
Bookmark button
Alert button
Nov 20, 2023
Hanako Segawa, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Rintaro Ikeshita, Shoko Araki, Takeshi Yamada, Shoji Makino

Figure 1 for Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss
Figure 2 for Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss
Figure 3 for Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss
Viaarxiv icon

Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data

Add code
Bookmark button
Alert button
May 25, 2023
Takafumi Moriya, Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Takanori Ashihara, Kohei Matsuura, Tomohiro Tanaka, Ryo Masumura, Atsunori Ogawa, Taichi Asami

Figure 1 for Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data
Figure 2 for Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data
Figure 3 for Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data
Figure 4 for Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data
Viaarxiv icon

Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss

Add code
Bookmark button
Alert button
May 24, 2023
Hiroshi Sato, Ryo Masumura, Tsubasa Ochiai, Marc Delcroix, Takafumi Moriya, Takanori Ashihara, Kentaro Shinayama, Saki Mizuno, Mana Ihori, Tomohiro Tanaka, Nobukatsu Hojo

Figure 1 for Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Figure 2 for Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Figure 3 for Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Figure 4 for Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Viaarxiv icon

Neural Target Speech Extraction: An Overview

Add code
Bookmark button
Alert button
Jan 31, 2023
Katerina Zmolikova, Marc Delcroix, Tsubasa Ochiai, Keisuke Kinoshita, Jan Černocký, Dong Yu

Figure 1 for Neural Target Speech Extraction: An Overview
Figure 2 for Neural Target Speech Extraction: An Overview
Figure 3 for Neural Target Speech Extraction: An Overview
Figure 4 for Neural Target Speech Extraction: An Overview
Viaarxiv icon

Streaming Target-Speaker ASR with Neural Transducer

Add code
Bookmark button
Alert button
Sep 19, 2022
Takafumi Moriya, Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Takahiro Shinozaki

Figure 1 for Streaming Target-Speaker ASR with Neural Transducer
Figure 2 for Streaming Target-Speaker ASR with Neural Transducer
Figure 3 for Streaming Target-Speaker ASR with Neural Transducer
Figure 4 for Streaming Target-Speaker ASR with Neural Transducer
Viaarxiv icon