Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Haizhou Li

Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection


Jul 14, 2021
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li

* ACM Multimedia 2021 

  Access Paper or Ask Questions

Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding


Jul 14, 2021
Hongning Zhu, Kong Aik Lee, Haizhou Li

* Accepted by Interspeech 2021 

  Access Paper or Ask Questions

Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer


Jul 08, 2021
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li

* Submitted to ASRU 2021 

  Access Paper or Ask Questions

Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification


Jun 17, 2021
Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, Haizhou Li


  Access Paper or Ask Questions

Selective Hearing through Lip-reading


Jun 14, 2021
Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li


  Access Paper or Ask Questions

DynaEval: Unifying Turn and Dialogue Level Evaluation


Jun 06, 2021
Chen Zhang, Yiming Chen, Luis Fernando D'Haro, Yan Zhang, Thomas Friedrichs, Grandee Lee, Haizhou Li

* ACL-IJCNLP 2021 (Main conference, Long paper) 

  Access Paper or Ask Questions

Emotional Voice Conversion: Theory, Databases and ESD


May 31, 2021
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li

* Submitted to Speech Communication 

  Access Paper or Ask Questions

Multi-target DoA Estimation with an Audio-visual Fusion Mechanism


May 13, 2021
Xinyuan Qian, Maulik Madhavi, Zexu Pan, Jiadong Wang, Haizhou Li

* ICASSP 2021 accepted 

  Access Paper or Ask Questions

The Multi-speaker Multi-style Voice Cloning Challenge 2021


Apr 05, 2021
Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu

* has been accepted to ICASSP 2021 

  Access Paper or Ask Questions

Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability


Apr 03, 2021
Rui Liu, Berrak Sisman, Haizhou Li

* 5 pages, 4 figures, Submitted to Interspeech 2021, Speech Samples: https://ttslr.github.io/i-ETTS 

  Access Paper or Ask Questions

Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech


Apr 02, 2021
Chenglin Xu, Wei Rao, Jibin Wu, Haizhou Li

* 13 pages, submitted to IEEE/ACM transaction on Audio, Speech and Language on 10 Jan. 2021 

  Access Paper or Ask Questions

Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training


Mar 31, 2021
Kun Zhou, Berrak Sisman, Haizhou Li

* Submitted to Interspeech 2021 

  Access Paper or Ask Questions

Low-latency auditory spatial attention detection based on spectro-spatial features from EEG


Mar 05, 2021
Siqi Cai, Pengcheng Sun, Tanja Schultz, Haizhou Li

* International Conference of the IEEE Engineering in Medicine and Biology Society 

  Access Paper or Ask Questions

Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification


Feb 15, 2021
Bidisha Sharma, Maulik Madhavi, Haizhou Li


  Access Paper or Ask Questions

Data Augmentation with Signal Companding for Detection of Logical Access Attacks


Feb 12, 2021
Rohan Kumar Das, Jichen Yang, Haizhou Li

* 5 pages, Accepted for publication in International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2021 

  Access Paper or Ask Questions

Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals


Nov 19, 2020
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li

* submit to ICASSP 2021 

  Access Paper or Ask Questions

VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech


Nov 03, 2020
Kun Zhou, Berrak Sisman, Haizhou Li

* Accepted by IEEE SLT 2021. arXiv admin note: text overlap with arXiv:2005.07025 

  Access Paper or Ask Questions

Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset


Oct 28, 2020
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis


Oct 23, 2020
Rui Liu, Berrak Sisman, Haizhou Li

* This paper was submitted to ICASSP2021 

  Access Paper or Ask Questions

Speaker-Utterance Dual Attention for Speaker and Utterance Verification


Aug 20, 2020
Tianchi Liu, Rohan Kumar Das, Maulik Madhavi, Shengmei Shen, Haizhou Li

* Accepted by Interspeech 2020 

  Access Paper or Ask Questions

Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN


Aug 12, 2020
Zongyang Du, Kun Zhou, Berrak Sisman, Haizhou Li

* Submitted to APSIPA ASC 2020 

  Access Paper or Ask Questions

VAW-GAN for Singing Voice Conversion with Non-parallel Training Data


Aug 12, 2020
Junchen Lu, Kun Zhou, Berrak Sisman, Haizhou Li

* Submitted to APSIPA ASC 2020 

  Access Paper or Ask Questions

Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS


Aug 11, 2020
Rui Liu, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li

* To appear in IEEE Signal Processing Letters (SPL) 

  Access Paper or Ask Questions

Multi-Tones' Phase Coding (MTPC) of Interaural Time Difference by Spiking Neural Network


Jul 07, 2020
Zihan Pan, Malu Zhang, Jibin Wu, Haizhou Li


  Access Paper or Ask Questions

Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks


Jul 02, 2020
Jibin Wu, Chenglin Xu, Daquan Zhou, Haizhou Li, Kay Chen Tan


  Access Paper or Ask Questions

You Only Spike Once: Improving Energy-Efficient Neuromorphic Inference to ANN-Level Accuracy


Jun 03, 2020
Srivatsa P, Kyle Timothy Ng Chu, Yaswanth Tavva, Jibin Wu, Malu Zhang, Haizhou Li, Trevor E. Carlson

* 10 pages, 4 figures, extended version of the paper accepted to the 2nd Workshop on Accelerated Machine Learning (AccML 2020) 

  Access Paper or Ask Questions