Alert button
Picture for Dong Yu

Dong Yu

Alert button

Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jun 08, 2021
Max W. Y. Lam, Jun Wang, Chao Weng, Dan Su, Dong Yu

Figure 1 for Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Figure 2 for Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Figure 3 for Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Figure 4 for Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Viaarxiv icon

Latency-Controlled Neural Architecture Search for Streaming Speech Recognition

Add code
Bookmark button
Alert button
May 08, 2021
Liqiang He, Shulin Feng, Dan Su, Dong Yu

Figure 1 for Latency-Controlled Neural Architecture Search for Streaming Speech Recognition
Figure 2 for Latency-Controlled Neural Architecture Search for Streaming Speech Recognition
Figure 3 for Latency-Controlled Neural Architecture Search for Streaming Speech Recognition
Figure 4 for Latency-Controlled Neural Architecture Search for Streaming Speech Recognition
Viaarxiv icon

SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts

Add code
Bookmark button
Alert button
May 07, 2021
Zhao You, Shulin Feng, Dan Su, Dong Yu

Figure 1 for SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts
Figure 2 for SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts
Figure 3 for SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts
Figure 4 for SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts
Viaarxiv icon

Video-aided Unsupervised Grammar Induction

Add code
Bookmark button
Alert button
May 04, 2021
Songyang Zhang, Linfeng Song, Lifeng Jin, Kun Xu, Dong Yu, Jiebo Luo

Figure 1 for Video-aided Unsupervised Grammar Induction
Figure 2 for Video-aided Unsupervised Grammar Induction
Figure 3 for Video-aided Unsupervised Grammar Induction
Figure 4 for Video-aided Unsupervised Grammar Induction
Viaarxiv icon

MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation

Add code
Bookmark button
Alert button
Apr 26, 2021
Xiyun Li, Yong Xu, Meng Yu, Shi-Xiong Zhang, Jiaming Xu, Bo Xu, Dong Yu

Figure 1 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Figure 2 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Figure 3 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Figure 4 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Viaarxiv icon

Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain

Add code
Bookmark button
Alert button
Apr 26, 2021
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu

Figure 1 for Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain
Figure 2 for Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain
Figure 3 for Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain
Viaarxiv icon

Conversational Semantic Role Labeling

Add code
Bookmark button
Alert button
Apr 11, 2021
Kun Xu, Han Wu, Linfeng Song, Haisong Zhang, Linqi Song, Dong Yu

Figure 1 for Conversational Semantic Role Labeling
Figure 2 for Conversational Semantic Role Labeling
Figure 3 for Conversational Semantic Role Labeling
Figure 4 for Conversational Semantic Role Labeling
Viaarxiv icon

MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment

Add code
Bookmark button
Alert button
Apr 02, 2021
Meng Yu, Chunlei Zhang, Yong Xu, Shixiong Zhang, Dong Yu

Figure 1 for MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Figure 2 for MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Figure 3 for MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Figure 4 for MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Viaarxiv icon