Alert button
Picture for Marc Delcroix

Marc Delcroix

Alert button

Target Speech Extraction with Pre-trained Self-supervised Learning Models

Add code
Bookmark button
Alert button
Feb 17, 2024
Junyi Peng, Marc Delcroix, Tsubasa Ochiai, Oldrich Plchot, Shoko Araki, Jan Cernocky

Viaarxiv icon

Probing Self-supervised Learning Models with Target Speech Extraction

Add code
Bookmark button
Alert button
Feb 17, 2024
Junyi Peng, Marc Delcroix, Tsubasa Ochiai, Oldrich Plchot, Takanori Ashihara, Shoko Araki, Jan Cernocky

Viaarxiv icon

Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers

Add code
Bookmark button
Alert button
Feb 05, 2024
Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki, Simon Doclo

Viaarxiv icon

What Do Self-Supervised Speech and Speaker Models Learn? New Findings From a Cross Model Layer-Wise Analysis

Add code
Bookmark button
Alert button
Jan 31, 2024
Takanori Ashihara, Marc Delcroix, Takafumi Moriya, Kohei Matsuura, Taichi Asami, Yusuke Ijima

Viaarxiv icon

Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters

Add code
Bookmark button
Alert button
Jan 10, 2024
Kenichi Fujita, Hiroshi Sato, Takanori Ashihara, Hiroki Kanagawa, Marc Delcroix, Takafumi Moriya, Yusuke Ijima

Viaarxiv icon

BLSTM-Based Confidence Estimation for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Dec 22, 2023
Atsunori Ogawa, Naohiro Tawara, Takatomo Kano, Marc Delcroix

Viaarxiv icon

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models

Add code
Bookmark button
Alert button
Dec 20, 2023
Atsunori Ogawa, Naohiro Tawara, Marc Delcroix, Shoko Araki

Viaarxiv icon

How does end-to-end speech recognition training impact speech enhancement artifacts?

Add code
Bookmark button
Alert button
Nov 20, 2023
Kazuma Iwamoto, Tsubasa Ochiai, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri

Viaarxiv icon

Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss

Add code
Bookmark button
Alert button
Nov 20, 2023
Hanako Segawa, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Rintaro Ikeshita, Shoko Araki, Takeshi Yamada, Shoji Makino

Viaarxiv icon

Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Oct 17, 2023
Atsunori Ogawa, Takafumi Moriya, Naoyuki Kamo, Naohiro Tawara, Marc Delcroix

Viaarxiv icon