Alert button
Picture for Tomoki Hayashi

Tomoki Hayashi

Alert button

ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit

Add code
Bookmark button
Alert button
Apr 11, 2023
Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polák, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe

Figure 1 for ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
Figure 2 for ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
Figure 3 for ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
Figure 4 for ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
Viaarxiv icon

Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study

Add code
Bookmark button
Alert button
Jan 26, 2023
Massa Baali, Tomoki Hayashi, Hamdy Mubarak, Soumi Maiti, Shinji Watanabe, Wassim El-Hajj, Ahmed Ali

Figure 1 for Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study
Figure 2 for Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study
Figure 3 for Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study
Figure 4 for Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study
Viaarxiv icon

ESPnet-ONNX: Bridging a Gap Between Research and Production

Add code
Bookmark button
Alert button
Sep 20, 2022
Masao Someki, Yosuke Higuchi, Tomoki Hayashi, Shinji Watanabe

Figure 1 for ESPnet-ONNX: Bridging a Gap Between Research and Production
Figure 2 for ESPnet-ONNX: Bridging a Gap Between Research and Production
Figure 3 for ESPnet-ONNX: Bridging a Gap Between Research and Production
Figure 4 for ESPnet-ONNX: Bridging a Gap Between Research and Production
Viaarxiv icon

A Comparative Study of Self-supervised Speech Representation Based Voice Conversion

Add code
Bookmark button
Alert button
Jul 10, 2022
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Tomoki Toda

Figure 1 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 2 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 3 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 4 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Viaarxiv icon

Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure

Add code
Bookmark button
Alert button
Jun 13, 2022
Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda

Figure 1 for Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure
Figure 2 for Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure
Figure 3 for Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure
Figure 4 for Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure
Viaarxiv icon

Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis

Add code
Bookmark button
Alert button
May 09, 2022
Jiatong Shi, Shuai Guo, Tao Qian, Nan Huo, Tomoki Hayashi, Yuning Wu, Frank Xu, Xuankai Chang, Huazhe Li, Peter Wu, Shinji Watanabe, Qin Jin

Figure 1 for Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Figure 2 for Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Figure 3 for Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Figure 4 for Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Viaarxiv icon

Acoustic Event Detection with Classifier Chains

Add code
Bookmark button
Alert button
Feb 17, 2022
Tatsuya Komatsu, Shinji Watanabe, Koichi Miyazaki, Tomoki Hayashi

Figure 1 for Acoustic Event Detection with Classifier Chains
Figure 2 for Acoustic Event Detection with Classifier Chains
Figure 3 for Acoustic Event Detection with Classifier Chains
Figure 4 for Acoustic Event Detection with Classifier Chains
Viaarxiv icon

Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem

Add code
Bookmark button
Alert button
Jan 09, 2022
Jing Shi, Xuankai Chang, Tomoki Hayashi, Yen-Ju Lu, Shinji Watanabe, Bo Xu

Figure 1 for Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Figure 2 for Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Figure 3 for Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Figure 4 for Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Viaarxiv icon

ViCE: Self-Supervised Visual Concept Embeddings as Contextual and Pixel Appearance Invariant Semantic Representations

Add code
Bookmark button
Alert button
Nov 24, 2021
Robin Karlsson, Tomoki Hayashi, Keisuke Fujii, Alexander Carballo, Kento Ohtani, Kazuya Takeda

Figure 1 for ViCE: Self-Supervised Visual Concept Embeddings as Contextual and Pixel Appearance Invariant Semantic Representations
Figure 2 for ViCE: Self-Supervised Visual Concept Embeddings as Contextual and Pixel Appearance Invariant Semantic Representations
Figure 3 for ViCE: Self-Supervised Visual Concept Embeddings as Contextual and Pixel Appearance Invariant Semantic Representations
Figure 4 for ViCE: Self-Supervised Visual Concept Embeddings as Contextual and Pixel Appearance Invariant Semantic Representations
Viaarxiv icon