Alert button
Picture for Takuya Yoshioka

Takuya Yoshioka

Alert button

Factual Consistency Oriented Speech Recognition

Add code
Bookmark button
Alert button
Feb 24, 2023
Naoyuki Kanda, Takuya Yoshioka, Yang Liu

Figure 1 for Factual Consistency Oriented Speech Recognition
Figure 2 for Factual Consistency Oriented Speech Recognition
Figure 3 for Factual Consistency Oriented Speech Recognition
Figure 4 for Factual Consistency Oriented Speech Recognition
Viaarxiv icon

Exploring WavLM on Speech Enhancement

Add code
Bookmark button
Alert button
Nov 18, 2022
Hyungchan Song, Sanyuan Chen, Zhuo Chen, Yu Wu, Takuya Yoshioka, Min Tang, Jong Won Shin, Shujie Liu

Figure 1 for Exploring WavLM on Speech Enhancement
Figure 2 for Exploring WavLM on Speech Enhancement
Figure 3 for Exploring WavLM on Speech Enhancement
Viaarxiv icon

Simulating realistic speech overlaps improves multi-talker ASR

Add code
Bookmark button
Alert button
Nov 17, 2022
Muqiao Yang, Naoyuki Kanda, Xiaofei Wang, Jian Wu, Sunit Sivasankaran, Zhuo Chen, Jinyu Li, Takuya Yoshioka

Figure 1 for Simulating realistic speech overlaps improves multi-talker ASR
Figure 2 for Simulating realistic speech overlaps improves multi-talker ASR
Figure 3 for Simulating realistic speech overlaps improves multi-talker ASR
Figure 4 for Simulating realistic speech overlaps improves multi-talker ASR
Viaarxiv icon

Real-Time Target Sound Extraction

Add code
Bookmark button
Alert button
Nov 14, 2022
Bandhav Veluri, Justin Chan, Malek Itani, Tuochao Chen, Takuya Yoshioka, Shyamnath Gollakota

Figure 1 for Real-Time Target Sound Extraction
Figure 2 for Real-Time Target Sound Extraction
Figure 3 for Real-Time Target Sound Extraction
Figure 4 for Real-Time Target Sound Extraction
Viaarxiv icon

Breaking trade-offs in speech separation with sparsely-gated mixture of experts

Add code
Bookmark button
Alert button
Nov 11, 2022
Xiaofei Wang, Zhuo Chen, Yu Shi, Jian Wu, Naoyuki Kanda, Takuya Yoshioka

Figure 1 for Breaking trade-offs in speech separation with sparsely-gated mixture of experts
Figure 2 for Breaking trade-offs in speech separation with sparsely-gated mixture of experts
Figure 3 for Breaking trade-offs in speech separation with sparsely-gated mixture of experts
Viaarxiv icon

Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition

Add code
Bookmark button
Alert button
Nov 10, 2022
Zili Huang, Zhuo Chen, Naoyuki Kanda, Jian Wu, Yiming Wang, Jinyu Li, Takuya Yoshioka, Xiaofei Wang, Peidong Wang

Figure 1 for Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition
Figure 2 for Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition
Figure 3 for Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition
Figure 4 for Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition
Viaarxiv icon

Speech separation with large-scale self-supervised learning

Add code
Bookmark button
Alert button
Nov 09, 2022
Zhuo Chen, Naoyuki Kanda, Jian Wu, Yu Wu, Xiaofei Wang, Takuya Yoshioka, Jinyu Li, Sunit Sivasankaran, Sefik Emre Eskimez

Figure 1 for Speech separation with large-scale self-supervised learning
Figure 2 for Speech separation with large-scale self-supervised learning
Figure 3 for Speech separation with large-scale self-supervised learning
Figure 4 for Speech separation with large-scale self-supervised learning
Viaarxiv icon

Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation

Add code
Bookmark button
Alert button
Nov 05, 2022
Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka

Figure 1 for Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation
Figure 2 for Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation
Figure 3 for Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation
Viaarxiv icon

Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation with E3Net

Add code
Bookmark button
Alert button
Nov 04, 2022
Sefik Emre Eskimez, Takuya Yoshioka, Alex Ju, Min Tang, Tanel Parnamaa, Huaming Wang

Figure 1 for Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation with E3Net
Figure 2 for Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation with E3Net
Figure 3 for Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation with E3Net
Viaarxiv icon