Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes



Shaojin Ding , Weiran Wang , Ding Zhao , Tara N. Sainath , Yanzhang He , Robert David , Rami Botros , Xin Wang , Rina Panigrahy , Qiao Liang , Dongseong Hwang , Ian McGraw , Rohit Prabhavalkar , Trevor Strohman

* Submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition



Shaojin Ding , Rajeev Rikhye , Qiao Liang , Yanzhang He , Quan Wang , Arun Narayanan , Tom O'Malley , Ian McGraw

* Submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

4-bit Conformer with Native Quantization Aware Training for Speech Recognition



Shaojin Ding , Phoenix Meadowlark , Yanzhang He , Lukasz Lew , Shivani Agrawal , Oleg Rybakov

* Submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis



Mu Yang , Shaojin Ding , Tianlong Chen , Tong Wang , Zhangyang Wang

* Submitted to ICASSP 2022 

   Access Paper or Ask Questions

Textual Echo Cancellation



Shaojin Ding , Ye Jia , Ke Hu , Quan Wang


   Access Paper or Ask Questions

AutoSpeech: Neural Architecture Search for Speaker Recognition



Shaojin Ding , Tianlong Chen , Xinyu Gong , Weiwei Zha , Zhangyang Wang


   Access Paper or Ask Questions

Personal VAD: Speaker-Conditioned Voice Activity Detection



Shaojin Ding , Quan Wang , Shuo-yiin Chang , Li Wan , Ignacio Lopez Moreno

* To be submitted to ICASSP 2020 

   Access Paper or Ask Questions

ABD-Net: Attentive but Diverse Person Re-Identification



Tianlong Chen , Shaojin Ding , Jingyi Xie , Ye Yuan , Wuyang Chen , Yang Yang , Zhou Ren , Zhangyang Wang

* ICCV2019 

   Access Paper or Ask Questions