Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Zhiyao Duan

A Novel 1D State Space for Efficient Music Rhythmic Analysis


Nov 01, 2021
Mojtaba Heydari, Matthew McCallum, Andreas Ehmann, Zhiyao Duan

* Submitted to International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022. The source code, video demos, and user package are available in the following GitHub repository: https://github.com/mjhydri/1D-StateSpace 

  Access Paper or Ask Questions

A study of the robustness of raw waveform based speaker embeddings under mismatched conditions


Oct 11, 2021
Ge Zhu, Frank Cwitkowitz, Zhiyao Duan


  Access Paper or Ask Questions

UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021


Aug 23, 2021
Xinhui Chen, You Zhang, Ge Zhu, Zhiyao Duan

* To appear in Proc. ASVspoof 2021 Workshop 

  Access Paper or Ask Questions

Learning Sparse Analytic Filters for Piano Transcription


Aug 23, 2021
Frank Cwitkowitz, Mojtaba Heydari, Zhiyao Duan


  Access Paper or Ask Questions

BeatNet: CRNN and Particle Filtering for Online Joint Beat Downbeat and Meter Tracking


Aug 08, 2021
Mojtaba Heydari, Frank Cwitkowitz, Zhiyao Duan

* 22nd International Society for Music Information Retrieval (ISMIR) Conference Paper, Fall 2021. 8 Pages (Total), 3 Figures, 2 Tables, 1 Algorithm 

  Access Paper or Ask Questions

Audiovisual Singing Voice Separation


Jul 01, 2021
Bochen Li, Yuxuan Wang, Zhiyao Duan


  Access Paper or Ask Questions

An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems


Apr 03, 2021
You Zhang, Ge Zhu, Fei Jiang, Zhiyao Duan

* 5 pages, 6 figures, submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Themes Inferred Audio-visual Correspondence Learning


Sep 14, 2020
Runze Su, Fei Tao, Xudong Liu, Haoran Wei, Xiaorong Mei, Zhiyao Duan, Lei Yuan, Ji Liu, Yuying Xie

* Submitting to ICASSP 2020 

  Access Paper or Ask Questions

Speech Driven Talking Face Generation from a Single Image and an Emotion Condition


Aug 08, 2020
Sefik Emre Eskimez, You Zhang, Zhiyao Duan


  Access Paper or Ask Questions

RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning


Feb 08, 2020
Nan Jiang, Sheng Jin, Zhiyao Duan, Changshui Zhang


  Access Paper or Ask Questions

Hierarchical Cross-Modal Talking Face Generationwith Dynamic Pixel-Wise Loss


May 09, 2019
Lele Chen, Ross K. Maddox, Zhiyao Duan, Chenliang Xu

* Published in CVPR 2019 

  Access Paper or Ask Questions

Lip Movements Generation at a Glance


May 21, 2018
Lele Chen, Zhiheng Li, Ross K. Maddox, Zhiyao Duan, Chenliang Xu


  Access Paper or Ask Questions

Generating Talking Face Landmarks from Speech


Apr 23, 2018
Sefik Emre Eskimez, Ross K Maddox, Chenliang Xu, Zhiyao Duan

* To Appear in LVA ICA 2018. Please see the following link: http://www2.ece.rochester.edu/projects/air/projects/talkingface.html 

  Access Paper or Ask Questions

Audio-Visual Event Localization in Unconstrained Videos


Mar 23, 2018
Yapeng Tian, Jing Shi, Bochen Li, Zhiyao Duan, Chenliang Xu

* 23 pages, 7 figures 

  Access Paper or Ask Questions

Deep Cross-Modal Audio-Visual Generation


Apr 26, 2017
Lele Chen, Sudhanshu Srivastava, Zhiyao Duan, Chenliang Xu


  Access Paper or Ask Questions