Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure



Ibuki Kuroyanagi , Tomoki Hayashi , Kazuya Takeda , Tomoki Toda

* 5 pages, 3 figures, 3 tables, EUSIPCO 2022 

   Access Paper or Ask Questions

Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation



Reo Yoneyama , Yi-Chiao Wu , Tomoki Toda

* Submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition



Lester Phillip Violeta , Wen-Chin Huang , Tomoki Toda

* Submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

The VoiceMOS Challenge 2022



Wen-Chin Huang , Erica Cooper , Yu Tsao , Hsin-Min Wang , Tomoki Toda , Junichi Yamagishi

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion



Chao Xie , Yi-Chiao Wu , Patrick Lumban Tobing , Wen-Chin Huang , Tomoki Toda


   Access Paper or Ask Questions

HASA-net: A non-intrusive hearing-aid speech assessment network



Hsin-Tien Chiang , Yi-Chiao Wu , Cheng Yu , Tomoki Toda , Hsin-Min Wang , Yih-Chun Hu , Yu Tsao


   Access Paper or Ask Questions

LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech



Wen-Chin Huang , Erica Cooper , Junichi Yamagishi , Tomoki Toda

* Submitted to ICASSP 2022. Code available at: https://github.com/unilight/LDNet 

   Access Paper or Ask Questions

Generalization Ability of MOS Prediction Networks



Erica Cooper , Wen-Chin Huang , Tomoki Toda , Junichi Yamagishi

* Submitted to ICASSP 2022 

   Access Paper or Ask Questions

Towards Identity Preserving Normal to Dysarthric Voice Conversion



Wen-Chin Huang , Bence Mark Halpern , Lester Phillip Violeta , Odette Scharenborg , Tomoki Toda

* Submitted to ICASSP 2022 

   Access Paper or Ask Questions

S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations



Wen-Chin Huang , Shu-Wen Yang , Tomoki Hayashi , Hung-Yi Lee , Shinji Watanabe , Tomoki Toda

* Submitted to ICASSP 2022. Code available at: https://github.com/s3prl/s3prl/tree/master/s3prl/downstream/a2o-vc-vcc2020 

   Access Paper or Ask Questions

1
2
3
4
5
>>