Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Zhiyong Wu

Transformer-S2A: Robust and Efficient Speech-to-Animation


Nov 18, 2021
Liyang Chen, Zhiyong Wu, Jun Ling, Runnan Li, Xu Tan, Sheng Zhao

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

An Approach to Mispronunciation Detection and Diagnosis with Acoustic, Phonetic and Linguistic (APL) Embeddings


Oct 14, 2021
Wenxuan Ye, Shaoguang Mao, Frank Soong, Wenshan Wu, Yan Xia, Jonathan Tien, Zhiyong Wu


  Access Paper or Ask Questions

Learning from Multiple Noisy Augmented Data Sets for Better Cross-Lingual Spoken Language Understanding


Sep 03, 2021
Yingmei Guo, Linjun Shou, Jian Pei, Ming Gong, Mingxing Xu, Zhiyong Wu, Daxin Jiang

* Long paper at EMNLP 2021 

  Access Paper or Ask Questions

VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis


Jul 07, 2021
Hui Lu, Zhiyong Wu, Xixin Wu, Xu Li, Shiyin Kang, Xunying Liu, Helen Meng


  Access Paper or Ask Questions

Spotting adversarial samples for speaker verification by neural vocoders


Jul 02, 2021
Haibin Wu, Po-chun Hsu, Ji Gao, Shanshan Zhang, Shen Huang, Jian Kang, Zhiyong Wu, Helen Meng, Hung-yi Lee

* Submitted to ASRU 2021 

  Access Paper or Ask Questions

Voting for the right answer: Adversarial defense for speaker verification


Jun 15, 2021
Haibin Wu, Yang Zhang, Zhiyong Wu, Dong Wang, Hung-yi Lee

* Accepted by Interspeech 2021. Code is available at https://github.com/thuhcsi/adsv_voting 

  Access Paper or Ask Questions

Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning


Jun 14, 2021
Haibin Wu, Xu Li, Andy T. Liu, Zhiyong Wu, Helen Meng, Hung-yi Lee

* Submitted to TASLP on 19 April 2021 

  Access Paper or Ask Questions

Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis


Jun 11, 2021
Jingbei Li, Yi Meng, Chenyi Li, Zhiyong Wu, Helen Meng, Chao Weng, Dan Su


  Access Paper or Ask Questions

Adversarial Defense for Automatic Speaker Verification by Self-Supervised Learning


Jun 01, 2021
Haibin Wu, Xu Li, Andy T. Liu, Zhiyong Wu, Helen Meng, Hung-yi Lee

* Submitted to TASLP on 03 May 2021 

  Access Paper or Ask Questions

Cascaded Head-colliding Attention


May 31, 2021
Lin Zheng, Zhiyong Wu, Lingpeng Kong

* ACL 2021 Camera-ready version 

  Access Paper or Ask Questions

Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation


May 30, 2021
Zhiyong Wu, Lingpeng Kong, Wei Bi, Xiang Li, Ben Kao

* To appear at ACL 2021 main conference 

  Access Paper or Ask Questions

Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech


Apr 20, 2021
Yixuan Zhou, Changhe Song, Jingbei Li, Zhiyong Wu, Helen Meng

* 5 pages, submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Towards Multi-Scale Style Control for Expressive Speech Synthesis


Apr 08, 2021
Xiang Li, Changhe Song, Jingbei Li, Zhiyong Wu, Jia Jia, Helen Meng

* 5 pages, 4 figures, submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

The Multi-speaker Multi-style Voice Cloning Challenge 2021


Apr 05, 2021
Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu

* has been accepted to ICASSP 2021 

  Access Paper or Ask Questions

Adversarial defense for automatic speaker verification by cascaded self-supervised learning models


Feb 14, 2021
Haibin Wu, Xu Li, Andy T. Liu, Zhiyong Wu, Helen Meng, Hung-yi Lee

* Accepted to ICASSP 2021 

  Access Paper or Ask Questions

Adversarially learning disentangled speech representations for robust multi-factor voice conversion


Jan 30, 2021
Jie Wang, Jingbei Li, Xintao Zhao, Zhiyong Wu, Helen Meng


  Access Paper or Ask Questions

Unsupervised Cross-Lingual Speech Emotion Recognition Using DomainAdversarial Neural Network


Dec 21, 2020
Xiong Cai, Zhiyong Wu, Kuo Zhong, Bin Su, Dongyang Dai, Helen Meng

* This paper has been accepted by ISCSLP2021 

  Access Paper or Ask Questions

Syntactic representation learning for neural network based TTS with syntactic parse tree traversal


Dec 13, 2020
Changhe Song, Jingbei Li, Yixuan Zhou, Zhiyong Wu, Helen Meng

* This paper was submitted to ICASSP2021 

  Access Paper or Ask Questions

Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input


Oct 28, 2020
Xingchen Song, Zhiyong Wu, Yiheng Huang, Chao Weng, Dan Su, Helen Meng

* submitted to ICASSP 2021 

  Access Paper or Ask Questions

Improving pronunciation assessment via ordinal regression with anchored reference samples


Oct 26, 2020
Bin Su, Shaoguang Mao, Frank Soong, Yan Xia, Jonathan Tien, Zhiyong Wu


  Access Paper or Ask Questions

Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams


Jun 20, 2020
Huirong Huang, Zhiyong Wu, Shiyin Kang, Dongyang Dai, Jia Jia, Tianxiao Fu, Deyi Tuo, Guangzhi Lei, Peng Liu, Dan Su, Dong Yu, Helen Meng

* 5 pages, 5 figures 

  Access Paper or Ask Questions

Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement


May 26, 2020
Dongyang Dai, Li Chen, Yuping Wang, Mu Wang, Rui Xia, Xuchen Song, Zhiyong Wu, Yuxuan Wang


  Access Paper or Ask Questions

Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT


Apr 30, 2020
Zhiyong Wu, Yun Chen, Ben Kao, Qun Liu

* Accepted to ACL2020 as a long paper 

  Access Paper or Ask Questions

Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks


Oct 23, 2019
Xingchen Song, Guangsen Wang, Zhiyong Wu, Yiheng Huang, Dan Su, Dong Yu, Helen Meng

* \c{opyright} 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works 

  Access Paper or Ask Questions

Study on Feature Subspace of Archetypal Emotions for Speech Emotion Recognition


Nov 17, 2016
Xi Ma, Zhiyong Wu, Jia Jia, Mingxing Xu, Helen Meng, Lianhong Cai

* 5 pages, 4 figures, ICASSP-2017 

  Access Paper or Ask Questions

Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition


Sep 23, 2013
Xin Zheng, Zhiyong Wu, Helen Meng, Weifeng Li, Lianhong Cai

* 4 pages, 2 figures 

  Access Paper or Ask Questions