Alert button
Picture for Shoko Araki

Shoko Araki

Alert button

Target Speech Extraction with Pre-trained Self-supervised Learning Models

Add code
Bookmark button
Alert button
Feb 17, 2024
Junyi Peng, Marc Delcroix, Tsubasa Ochiai, Oldrich Plchot, Shoko Araki, Jan Cernocky

Viaarxiv icon

Probing Self-supervised Learning Models with Target Speech Extraction

Add code
Bookmark button
Alert button
Feb 17, 2024
Junyi Peng, Marc Delcroix, Tsubasa Ochiai, Oldrich Plchot, Takanori Ashihara, Shoko Araki, Jan Cernocky

Viaarxiv icon

Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers

Add code
Bookmark button
Alert button
Feb 05, 2024
Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki, Simon Doclo

Viaarxiv icon

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models

Add code
Bookmark button
Alert button
Dec 20, 2023
Atsunori Ogawa, Naohiro Tawara, Marc Delcroix, Shoko Araki

Viaarxiv icon

How does end-to-end speech recognition training impact speech enhancement artifacts?

Add code
Bookmark button
Alert button
Nov 20, 2023
Kazuma Iwamoto, Tsubasa Ochiai, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri

Viaarxiv icon

Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss

Add code
Bookmark button
Alert button
Nov 20, 2023
Hanako Segawa, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Rintaro Ikeshita, Shoko Araki, Takeshi Yamada, Shoji Makino

Viaarxiv icon

Modified Parametric Multichannel Wiener Filter \\for Low-latency Enhancement of Speech Mixtures with Unknown Number of Speakers

Add code
Bookmark button
Alert button
Jun 29, 2023
Ning Guo, Tomohiro Nakatani, Shoko Araki, Takehiro Moriya

Figure 1 for Modified Parametric Multichannel Wiener Filter \\for Low-latency Enhancement of Speech Mixtures with Unknown Number of Speakers
Figure 2 for Modified Parametric Multichannel Wiener Filter \\for Low-latency Enhancement of Speech Mixtures with Unknown Number of Speakers
Figure 3 for Modified Parametric Multichannel Wiener Filter \\for Low-latency Enhancement of Speech Mixtures with Unknown Number of Speakers
Figure 4 for Modified Parametric Multichannel Wiener Filter \\for Low-latency Enhancement of Speech Mixtures with Unknown Number of Speakers
Viaarxiv icon

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization

Add code
Bookmark button
Alert button
May 23, 2023
Marc Delcroix, Naohiro Tawara, Mireia Diez, Federico Landini, Anna Silnova, Atsunori Ogawa, Tomohiro Nakatani, Lukas Burget, Shoko Araki

Figure 1 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Figure 2 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Figure 3 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Viaarxiv icon

ConceptBeam: Concept Driven Target Speech Extraction

Add code
Bookmark button
Alert button
Jul 25, 2022
Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada, Kunio Kashino

Figure 1 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 2 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 3 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 4 for ConceptBeam: Concept Driven Target Speech Extraction
Viaarxiv icon

Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking

Add code
Bookmark button
Alert button
May 07, 2022
Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki

Figure 1 for Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking
Figure 2 for Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking
Figure 3 for Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking
Figure 4 for Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking
Viaarxiv icon