NAUTILUS: a Versatile Voice Cloning System

May 22, 2020
Hieu-Thi Luong, Junichi Yamagishi

* Submitted to The IEEE/ACM Transactions on Audio, Speech, and Language Processing 

  Access Model/Code and Paper
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis

May 20, 2020
Yusuke Yasuda, Xin Wang, Junichi Yamagishi


  Access Model/Code and Paper
Design Choices for X-vector Based Speaker Anonymization

May 18, 2020
Brij Mohan Lal Srivastava, Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Junichi Yamagishi, Mohamed Maouche, Aurélien Bellet, Marc Tommasi


  Access Model/Code and Paper
Introducing the VoicePrivacy Initiative

May 13, 2020
Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco

* Submitted to Interspeech 2020 

  Access Model/Code and Paper
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning

Feb 06, 2020
Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi

* Code available at https://github.com/Miffyli/asv-cm-reinforce 

  Access Model/Code and Paper
Detecting and Correcting Adversarial Images Using Image Processing Operations

Dec 30, 2019
Huy H. Nguyen, Minoru Kuribayashi, Junichi Yamagishi, Isao Echizen

* Fixing incorrect results by removing the CNN detector part 

  Access Model/Code and Paper
Detecting and Correcting Adversarial Images Using Image Processing Operations and Convolutional Neural Networks

Dec 11, 2019
Huy H. Nguyen, Minoru Kuribayashi, Junichi Yamagishi, Isao Echizen


  Access Model/Code and Paper
Transferring neural speech waveform synthesizers to musical instrument sounds generation

Nov 19, 2019
Yi Zhao, Xin Wang, Lauri Juvela, Junichi Yamagishi

* Submitted to ICASSP 2020 

  Access Model/Code and Paper
Security of Facial Forensics Models Against Adversarial Attacks

Nov 02, 2019
Rong Huang, Fuming Fang, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* Submitted to ICASSP 2020 

  Access Model/Code and Paper
A Method for Identifying Origin of Digital Images Using a Convolution Neural Network

Nov 02, 2019
Rong Huang, Fuming Fang, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* Submitted to ICASSP 2020 

  Access Model/Code and Paper
Use of a Capsule Network to Detect Fake Images and Videos

Oct 29, 2019
Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* Fixing Table 2's scale 

  Access Model/Code and Paper
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment

Oct 28, 2019
Yusuke Yasuda, Xin Wang, Junichi Yamagishi

* Submitted to ICASSP 2020 

  Access Model/Code and Paper
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech

Sep 14, 2019
Hieu-Thi Luong, Junichi Yamagishi

* Accepted for IEEE ASRU 2019 

  Access Model/Code and Paper
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments

Aug 30, 2019
Yusuke Yasuda, Xin Wang, Junichi Yamagishi

* To be appeared at SSW10 

  Access Model/Code and Paper
Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based Detection

Jul 22, 2019
David Ifeoluwa Adelani, Haotian Mai, Fuming Fang, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* Submitted to the IEEE International Workshop on Information Forensics and Security (WIFS) 

  Access Model/Code and Paper
A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation

Jun 18, 2019
Hieu-Thi Luong, Junichi Yamagishi

* Submitted to IEEE/ACM TASLP 

  Access Model/Code and Paper
Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos

Jun 17, 2019
Huy H. Nguyen, Fuming Fang, Junichi Yamagishi, Isao Echizen

* Accepted to be Published in Proceedings of the IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS) 2019, Florida, USA 

  Access Model/Code and Paper
Speaker Anonymization Using X-vector and Neural Waveform Models

May 30, 2019
Fuming Fang, Xin Wang, Junichi Yamagishi, Isao Echizen, Massimiliano Todisco, Nicholas Evans, Jean-Francois Bonastre

* Submitted to the 10th ISCA Speech Synthesis Workshop (SSW10) 

  Access Model/Code and Paper
Neural source-filter waveform models for statistical parametric speech synthesis

Apr 27, 2019
Xin Wang, Shinji Takaki, Junichi Yamagishi

* Submitted to IEEE/ACM TASLP 

  Access Model/Code and Paper
MOSNet: Deep Learning based Objective Assessment for Voice Conversion

Apr 26, 2019
Chen-Chou Lo, Szu-Wei Fu, Wen-Chin Huang, Xin Wang, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang

* Submitted to Interspeech2019 

  Access Model/Code and Paper
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram

Apr 10, 2019
Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku

* Submitted to Interspeech 2019; fixed typo in title 

  Access Model/Code and Paper
Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet

Apr 07, 2019
Mingyang Zhang, Xin Wang, Fuming Fang, Haizhou Li, Junichi Yamagishi

* Submitted to Interspeech 2019, Graz, Austria 

  Access Model/Code and Paper
Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform

Apr 07, 2019
Shinji Takaki, Hirokazu Kameoka, Junichi Yamagishi

* Submitted to Interspeech 2019, Graz, Austria 

  Access Model/Code and Paper
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora

Apr 07, 2019
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa

* Submitted to Interspeech 2019, Graz, Austria 

  Access Model/Code and Paper
Introduction to Voice Presentation Attack Detection and Recent Advances

Jan 04, 2019
Md Sahidullah, Hector Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas Evans, Junichi Yamagishi, Kong-Aik Lee

* Published in Handbook of Biometric Anti-Spoofing Presentation Attack Detection (Second Edition eBook ISBN 978-3-319-92627-8), 2019 
* Published as a book-chapter in Handbook of Biometric Anti-Spoofing Presentation Attack Detection (Second Edition) 

  Access Model/Code and Paper
Identifying Computer-Translated Paragraphs using Coherence Features

Dec 28, 2018
Hoang-Quoc Nguyen-Son, Ngoc-Dung T. Tieu, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

* 9 pages, PACLIC 2018 

  Access Model/Code and Paper
Attentive Filtering Networks for Audio Replay Attack Detection

Oct 31, 2018
Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King

* Submitted to ICASSP 2019 

  Access Model/Code and Paper