Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband Excitation for Noise-Controllable Waveform Generation



Tao Wang , Ruibo Fu , Jiangyan Yi , Jianhua Tao , Zhengqi Wen

* 15 pages, 12 figures; Accepted to TASLP. Demo page https://hairuo55.github.io/NeuralDPS. arXiv admin note: text overlap with arXiv:1906.09573 by other authors 

   Access Paper or Ask Questions

ADD 2022: the First Audio Deep Synthesis Detection Challenge



Jiangyan Yi , Ruibo Fu , Jianhua Tao , Shuai Nie , Haoxin Ma , Chenglong Wang , Tao Wang , Zhengkun Tian , Ye Bai , Cunhang Fan , Shan Liang , Shiming Wang , Shuai Zhang , Xinrui Yan , Le Xu , Zhengqi Wen , Haizhou Li , Zheng Lian , Bin Liu

* Accepted by ICASSP 2022 

   Access Paper or Ask Questions

CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing



Tao Wang , Jiangyan Yi , Ruibo Fu , Jianhua Tao , Zhengqi Wen

* under review, 14 pages, 14 figures, demo page is available at https://hairuo55.github.io/CampNet 

   Access Paper or Ask Questions

Singing-Tacotron: Global duration control attention and dynamic filter for End-to-end singing voice synthesis



Tao Wang , Ruibo Fu , Jiangyan Yi , Jianhua Tao , Zhengqi Wen

* 5 pages, 7 figures 

   Access Paper or Ask Questions

Reducing language context confusion for end-to-end code-switching automatic speech recognition



Shuai Zhang , Jiangyan Yi , Zhengkun Tian , Jianhua Tao , Yu Ting Yeung , Liqun Deng

* arXiv admin note: text overlap with arXiv:2010.14798 

   Access Paper or Ask Questions

Continual Learning for Fake Audio Detection



Haoxin Ma , Jiangyan Yi , Jianhua Tao , Ye Bai , Zhengkun Tian , Chenglong Wang

* 5 pages, conference 

   Access Paper or Ask Questions

Half-Truth: A Partially Fake Audio Detection Dataset



Jiangyan Yi , Ye Bai , Jianhua Tao , Zhengkun Tian , Chenglong Wang , Tao Wang , Ruibo Fu

* submitted to Interspeech 2021 

   Access Paper or Ask Questions

FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization



Zhengkun Tian , Jiangyan Yi , Ye Bai , Jianhua Tao , Shuai Zhang , Zhengqi Wen

* Submitted to INTERSPEECH2021 

   Access Paper or Ask Questions

TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech Recognition



Zhengkun Tian , Jiangyan Yi , Jianhua Tao , Ye Bai , Shuai Zhang , Zhengqi Wen , Xuefei Liu

* Submitted to Interspeech2021 

   Access Paper or Ask Questions

1
2
3
>>