Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS


Nov 04, 2022
Dongchao Yang, Songxiang Liu, Jianwei Yu, Helin Wang, Chao Weng, Yuexian Zou

Add code

* Submitted to ICASSP2023 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Diffsound: Discrete Diffusion Model for Text-to-sound Generation


Jul 20, 2022
Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu

Add code

* Submitted to TASLP2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection


May 23, 2022
Peilin Zhou, Dading Chong, Helin Wang, Qingcheng Zeng

Add code

* Submit to INTERSPEECH 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training


Apr 27, 2022
Dading Chong, Helin Wang, Peilin Zhou, Qingcheng Zeng

Add code

* Submit to INTERSPEECH 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection


Apr 05, 2022
Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang

Add code

* submitted to interspeech2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Two-student Learning Framework for Mixed Supervised Target Sound Detection


Apr 05, 2022
Dongchao Yang, Helin Wang, Yuexian Zou, Wenwu Wang

Add code

* submitted to interspeech2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Improving Target Sound Extraction with Timestamp Information


Apr 02, 2022
Helin Wang, Dongchao Yang, Chao Weng, Jianwei Yu, Yuexian Zou

Add code

* submitted to interspeech2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Detect what you want: Target Sound Detection


Dec 19, 2021
Dongchao Yang, Helin Wang, Yuexian Zou, Chao Weng

Add code

* Submitted to ICASSP2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information


Oct 12, 2021
Zhongjie Ye, Helin Wang, Dongchao Yang, Yuexian Zou

Add code

* 5 pages, 1 figure, accepted by DCASE 2021 workshop 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Mutual learning framework for Few-shot Sound Event Detection


Oct 09, 2021
Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang

Add code

* Submitted to ICASSP2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>