Alert button
Picture for Dong Yu

Dong Yu

Alert button

Bayes risk CTC: Controllable CTC alignment in Sequence-to-Sequence tasks

Add code
Bookmark button
Alert button
Oct 14, 2022
Jinchuan Tian, Brian Yan, Jianwei Yu, Chao Weng, Dong Yu, Shinji Watanabe

Figure 1 for Bayes risk CTC: Controllable CTC alignment in Sequence-to-Sequence tasks
Figure 2 for Bayes risk CTC: Controllable CTC alignment in Sequence-to-Sequence tasks
Figure 3 for Bayes risk CTC: Controllable CTC alignment in Sequence-to-Sequence tasks
Figure 4 for Bayes risk CTC: Controllable CTC alignment in Sequence-to-Sequence tasks
Viaarxiv icon

Cross-Lingual Speaker Identification Using Distant Supervision

Add code
Bookmark button
Alert button
Oct 11, 2022
Ben Zhou, Dian Yu, Dong Yu, Dan Roth

Figure 1 for Cross-Lingual Speaker Identification Using Distant Supervision
Figure 2 for Cross-Lingual Speaker Identification Using Distant Supervision
Figure 3 for Cross-Lingual Speaker Identification Using Distant Supervision
Figure 4 for Cross-Lingual Speaker Identification Using Distant Supervision
Viaarxiv icon

Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks

Add code
Bookmark button
Alert button
Oct 01, 2022
Zhenhailong Wang, Xiaoman Pan, Dian Yu, Dong Yu, Jianshu Chen, Heng Ji

Figure 1 for Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
Figure 2 for Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
Figure 3 for Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
Figure 4 for Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
Viaarxiv icon

C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification

Add code
Bookmark button
Alert button
Aug 15, 2022
Chunlei Zhang, Dong Yu

Figure 1 for C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification
Figure 2 for C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification
Figure 3 for C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification
Figure 4 for C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification
Viaarxiv icon

Diffsound: Discrete Diffusion Model for Text-to-sound Generation

Add code
Bookmark button
Alert button
Jul 20, 2022
Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu

Figure 1 for Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Figure 2 for Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Figure 3 for Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Figure 4 for Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Viaarxiv icon

Hierarchical Context Tagging for Utterance Rewriting

Add code
Bookmark button
Alert button
Jun 22, 2022
Lisa Jin, Linfeng Song, Lifeng Jin, Dong Yu, Daniel Gildea

Figure 1 for Hierarchical Context Tagging for Utterance Rewriting
Figure 2 for Hierarchical Context Tagging for Utterance Rewriting
Figure 3 for Hierarchical Context Tagging for Utterance Rewriting
Figure 4 for Hierarchical Context Tagging for Utterance Rewriting
Viaarxiv icon

Automatic Prosody Annotation with Pre-Trained Text-Speech Model

Add code
Bookmark button
Alert button
Jun 16, 2022
Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai, Dong Yu

Figure 1 for Automatic Prosody Annotation with Pre-Trained Text-Speech Model
Figure 2 for Automatic Prosody Annotation with Pre-Trained Text-Speech Model
Figure 3 for Automatic Prosody Annotation with Pre-Trained Text-Speech Model
Figure 4 for Automatic Prosody Annotation with Pre-Trained Text-Speech Model
Viaarxiv icon

UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder

Add code
Bookmark button
Alert button
Jun 07, 2022
Jiachen Lian, Chunlei Zhang, Gopala Krishna Anumanchipalli, Dong Yu

Figure 1 for UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder
Figure 2 for UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder
Figure 3 for UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder
Figure 4 for UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder
Viaarxiv icon

LAE: Language-Aware Encoder for Monolingual and Multilingual ASR

Add code
Bookmark button
Alert button
Jun 05, 2022
Jinchuan Tian, Jianwei Yu, Chunlei Zhang, Chao Weng, Yuexian Zou, Dong Yu

Figure 1 for LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
Figure 2 for LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
Figure 3 for LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
Figure 4 for LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
Viaarxiv icon

NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement

Add code
Bookmark button
Alert button
May 20, 2022
Meng Yu, Yong Xu, Chunlei Zhang, Shi-Xiong Zhang, Dong Yu

Figure 1 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 2 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 3 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 4 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Viaarxiv icon