Alert button
Picture for Zhenyao Zhu

Zhenyao Zhu

Alert button

Fully Supervised Speaker Diarization

Oct 27, 2018
Aonan Zhang, Quan Wang, Zhenyao Zhu, John Paisley, Chong Wang

Figure 1 for Fully Supervised Speaker Diarization
Figure 2 for Fully Supervised Speaker Diarization
Figure 3 for Fully Supervised Speaker Diarization
Figure 4 for Fully Supervised Speaker Diarization
Viaarxiv icon

Principled Hybrids of Generative and Discriminative Domain Adaptation

Oct 27, 2017
Han Zhao, Zhenyao Zhu, Junjie Hu, Adam Coates, Geoff Gordon

Figure 1 for Principled Hybrids of Generative and Discriminative Domain Adaptation
Figure 2 for Principled Hybrids of Generative and Discriminative Domain Adaptation
Figure 3 for Principled Hybrids of Generative and Discriminative Domain Adaptation
Figure 4 for Principled Hybrids of Generative and Discriminative Domain Adaptation
Viaarxiv icon

Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling

Aug 12, 2017
Hairong Liu, Zhenyao Zhu, Xiangang Li, Sanjeev Satheesh

Figure 1 for Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
Figure 2 for Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
Figure 3 for Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
Figure 4 for Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
Viaarxiv icon

Exploring Neural Transducers for End-to-End Speech Recognition

Jul 24, 2017
Eric Battenberg, Jitong Chen, Rewon Child, Adam Coates, Yashesh Gaur, Yi Li, Hairong Liu, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, Zhenyao Zhu

Figure 1 for Exploring Neural Transducers for End-to-End Speech Recognition
Figure 2 for Exploring Neural Transducers for End-to-End Speech Recognition
Figure 3 for Exploring Neural Transducers for End-to-End Speech Recognition
Figure 4 for Exploring Neural Transducers for End-to-End Speech Recognition
Viaarxiv icon

Reducing Bias in Production Speech Models

May 11, 2017
Eric Battenberg, Rewon Child, Adam Coates, Christopher Fougner, Yashesh Gaur, Jiaji Huang, Heewoo Jun, Ajay Kannan, Markus Kliegl, Atul Kumar, Hairong Liu, Vinay Rao, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, Zhenyao Zhu

Figure 1 for Reducing Bias in Production Speech Models
Figure 2 for Reducing Bias in Production Speech Models
Figure 3 for Reducing Bias in Production Speech Models
Figure 4 for Reducing Bias in Production Speech Models
Viaarxiv icon

Deep Speaker: an End-to-End Neural Speaker Embedding System

May 05, 2017
Chao Li, Xiaokong Ma, Bing Jiang, Xiangang Li, Xuewei Zhang, Xiao Liu, Ying Cao, Ajay Kannan, Zhenyao Zhu

Figure 1 for Deep Speaker: an End-to-End Neural Speaker Embedding System
Figure 2 for Deep Speaker: an End-to-End Neural Speaker Embedding System
Figure 3 for Deep Speaker: an End-to-End Neural Speaker Embedding System
Figure 4 for Deep Speaker: an End-to-End Neural Speaker Embedding System
Viaarxiv icon

Learning Multiscale Features Directly From Waveforms

Apr 05, 2016
Zhenyao Zhu, Jesse H. Engel, Awni Hannun

Figure 1 for Learning Multiscale Features Directly From Waveforms
Figure 2 for Learning Multiscale Features Directly From Waveforms
Figure 3 for Learning Multiscale Features Directly From Waveforms
Figure 4 for Learning Multiscale Features Directly From Waveforms
Viaarxiv icon

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Dec 08, 2015
Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse Engel, Linxi Fan, Christopher Fougner, Tony Han, Awni Hannun, Billy Jun, Patrick LeGresley, Libby Lin, Sharan Narang, Andrew Ng, Sherjil Ozair, Ryan Prenger, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Yi Wang, Zhiqian Wang, Chong Wang, Bo Xiao, Dani Yogatama, Jun Zhan, Zhenyao Zhu

Figure 1 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Figure 2 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Figure 3 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Figure 4 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Viaarxiv icon

DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection

Sep 11, 2014
Wanli Ouyang, Ping Luo, Xingyu Zeng, Shi Qiu, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Yuanjun Xiong, Chen Qian, Zhenyao Zhu, Ruohui Wang, Chen-Change Loy, Xiaogang Wang, Xiaoou Tang

Figure 1 for DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection
Figure 2 for DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection
Figure 3 for DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection
Figure 4 for DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection
Viaarxiv icon

Deep Learning Multi-View Representation for Face Recognition

Jun 26, 2014
Zhenyao Zhu, Ping Luo, Xiaogang Wang, Xiaoou Tang

Figure 1 for Deep Learning Multi-View Representation for Face Recognition
Figure 2 for Deep Learning Multi-View Representation for Face Recognition
Figure 3 for Deep Learning Multi-View Representation for Face Recognition
Figure 4 for Deep Learning Multi-View Representation for Face Recognition
Viaarxiv icon