Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm

Oct 21, 2020
Jennifer Williams, Yi Zhao, Erica Cooper, Junichi Yamagishi

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

Grapheme or phoneme? An Analysis of Tacotron's Embedded Representations

Oct 21, 2020
Antoine Perquin, Erica Cooper, Junichi Yamagishi

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

End-to-End Text-to-Speech using Latent Duration based on VQ-VAE

Oct 20, 2020
Yusuke Yasuda, Xin Wang, Junichi Yamagishi


  Access Paper or Ask Questions

Latent linguistic embedding for cross-lingual text-to-speech and voice conversion

Oct 08, 2020
Hieu-Thi Luong, Junichi Yamagishi

* Accepted to Voice Conversion Challenge 2020 Online Workshop 

  Access Paper or Ask Questions

Viable Threat on News Reading: Generating Biased News Using Natural Language Models

Oct 05, 2020
Saurabh Gupta, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* 11 pages, 4 figures, 6 tables, Accepted at NLP+CSS Workshop at EMNLP 2020 

  Access Paper or Ask Questions

Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals

Jul 12, 2020
Tomi Kinnunen, Héctor Delgado, Nicholas Evans, Kong Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md Sahidullah, Junichi Yamagishi, Douglas A. Reynolds

* Accepted for publication in IEEE/ACM Transactions on Audio, Speech, and Language Processing 

  Access Paper or Ask Questions

Generating Master Faces for Use in Performing Wolf Attacks on Face Recognition Systems

Jun 15, 2020
Huy H. Nguyen, Junichi Yamagishi, Isao Echizen, Sébastien Marcel

* Accepted to be Published in Proceedings of the 2020 International Joint Conference on Biometrics (IJCB 2020), Houston, USA 

  Access Paper or Ask Questions

NAUTILUS: a Versatile Voice Cloning System

May 22, 2020
Hieu-Thi Luong, Junichi Yamagishi

* Submitted to The IEEE/ACM Transactions on Audio, Speech, and Language Processing 

  Access Paper or Ask Questions

Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis

May 20, 2020
Yusuke Yasuda, Xin Wang, Junichi Yamagishi


  Access Paper or Ask Questions

Design Choices for X-vector Based Speaker Anonymization

May 18, 2020
Brij Mohan Lal Srivastava, Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Junichi Yamagishi, Mohamed Maouche, Aurélien Bellet, Marc Tommasi


  Access Paper or Ask Questions

Introducing the VoicePrivacy Initiative

May 13, 2020
Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco

* Submitted to Interspeech 2020 

  Access Paper or Ask Questions

An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning

Feb 06, 2020
Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi

* Code available at https://github.com/Miffyli/asv-cm-reinforce 

  Access Paper or Ask Questions

Detecting and Correcting Adversarial Images Using Image Processing Operations

Dec 30, 2019
Huy H. Nguyen, Minoru Kuribayashi, Junichi Yamagishi, Isao Echizen

* Fixing incorrect results by removing the CNN detector part 

  Access Paper or Ask Questions

Detecting and Correcting Adversarial Images Using Image Processing Operations and Convolutional Neural Networks

Dec 11, 2019
Huy H. Nguyen, Minoru Kuribayashi, Junichi Yamagishi, Isao Echizen


  Access Paper or Ask Questions

Transferring neural speech waveform synthesizers to musical instrument sounds generation

Nov 19, 2019
Yi Zhao, Xin Wang, Lauri Juvela, Junichi Yamagishi

* Submitted to ICASSP 2020 

  Access Paper or Ask Questions

Security of Facial Forensics Models Against Adversarial Attacks

Nov 02, 2019
Rong Huang, Fuming Fang, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* Submitted to ICASSP 2020 

  Access Paper or Ask Questions

A Method for Identifying Origin of Digital Images Using a Convolution Neural Network

Nov 02, 2019
Rong Huang, Fuming Fang, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* Submitted to ICASSP 2020 

  Access Paper or Ask Questions

Use of a Capsule Network to Detect Fake Images and Videos

Oct 29, 2019
Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* Fixing Table 2's scale 

  Access Paper or Ask Questions

Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment

Oct 28, 2019
Yusuke Yasuda, Xin Wang, Junichi Yamagishi

* Submitted to ICASSP 2020 

  Access Paper or Ask Questions

Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech

Sep 14, 2019
Hieu-Thi Luong, Junichi Yamagishi

* Accepted for IEEE ASRU 2019 

  Access Paper or Ask Questions

Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments

Aug 30, 2019
Yusuke Yasuda, Xin Wang, Junichi Yamagishi

* To be appeared at SSW10 

  Access Paper or Ask Questions

Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based Detection

Jul 22, 2019
David Ifeoluwa Adelani, Haotian Mai, Fuming Fang, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* Submitted to the IEEE International Workshop on Information Forensics and Security (WIFS) 

  Access Paper or Ask Questions

A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation

Jun 18, 2019
Hieu-Thi Luong, Junichi Yamagishi

* Submitted to IEEE/ACM TASLP 

  Access Paper or Ask Questions

Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos

Jun 17, 2019
Huy H. Nguyen, Fuming Fang, Junichi Yamagishi, Isao Echizen

* Accepted to be Published in Proceedings of the IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS) 2019, Florida, USA 

  Access Paper or Ask Questions

Speaker Anonymization Using X-vector and Neural Waveform Models

May 30, 2019
Fuming Fang, Xin Wang, Junichi Yamagishi, Isao Echizen, Massimiliano Todisco, Nicholas Evans, Jean-Francois Bonastre

* Submitted to the 10th ISCA Speech Synthesis Workshop (SSW10) 

  Access Paper or Ask Questions

Neural source-filter waveform models for statistical parametric speech synthesis

Apr 27, 2019
Xin Wang, Shinji Takaki, Junichi Yamagishi

* Submitted to IEEE/ACM TASLP 

  Access Paper or Ask Questions

MOSNet: Deep Learning based Objective Assessment for Voice Conversion

Apr 26, 2019
Chen-Chou Lo, Szu-Wei Fu, Wen-Chin Huang, Xin Wang, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang

* Submitted to Interspeech2019 

  Access Paper or Ask Questions