Picture for Hung-Yi Lee

Hung-Yi Lee

J-Net: Randomly weighted U-Net for audio source separation

Add code
Nov 29, 2019
Figure 1 for J-Net: Randomly weighted U-Net for audio source separation
Figure 2 for J-Net: Randomly weighted U-Net for audio source separation
Figure 3 for J-Net: Randomly weighted U-Net for audio source separation
Figure 4 for J-Net: Randomly weighted U-Net for audio source separation
Viaarxiv icon

Training a code-switching language model with monolingual data

Add code
Nov 14, 2019
Figure 1 for Training a code-switching language model with monolingual data
Figure 2 for Training a code-switching language model with monolingual data
Figure 3 for Training a code-switching language model with monolingual data
Figure 4 for Training a code-switching language model with monolingual data
Viaarxiv icon

What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis

Add code
Nov 04, 2019
Figure 1 for What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis
Figure 2 for What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis
Figure 3 for What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis
Figure 4 for What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis
Viaarxiv icon

SpeechBERT: Cross-Modal Pre-trained Language Model for End-to-end Spoken Question Answering

Add code
Oct 25, 2019
Figure 1 for SpeechBERT: Cross-Modal Pre-trained Language Model for End-to-end Spoken Question Answering
Figure 2 for SpeechBERT: Cross-Modal Pre-trained Language Model for End-to-end Spoken Question Answering
Figure 3 for SpeechBERT: Cross-Modal Pre-trained Language Model for End-to-end Spoken Question Answering
Figure 4 for SpeechBERT: Cross-Modal Pre-trained Language Model for End-to-end Spoken Question Answering
Viaarxiv icon

Tree Transformer: Integrating Tree Structures into Self-Attention

Add code
Sep 14, 2019
Figure 1 for Tree Transformer: Integrating Tree Structures into Self-Attention
Figure 2 for Tree Transformer: Integrating Tree Structures into Self-Attention
Figure 3 for Tree Transformer: Integrating Tree Structures into Self-Attention
Figure 4 for Tree Transformer: Integrating Tree Structures into Self-Attention
Viaarxiv icon

Order-free Learning Alleviating Exposure Bias in Multi-label Classification

Add code
Sep 08, 2019
Figure 1 for Order-free Learning Alleviating Exposure Bias in Multi-label Classification
Figure 2 for Order-free Learning Alleviating Exposure Bias in Multi-label Classification
Figure 3 for Order-free Learning Alleviating Exposure Bias in Multi-label Classification
Figure 4 for Order-free Learning Alleviating Exposure Bias in Multi-label Classification
Viaarxiv icon

LAMAL: LAnguage Modeling Is All You Need for Lifelong Language Learning

Add code
Sep 07, 2019
Figure 1 for LAMAL: LAnguage Modeling Is All You Need for Lifelong Language Learning
Figure 2 for LAMAL: LAnguage Modeling Is All You Need for Lifelong Language Learning
Figure 3 for LAMAL: LAnguage Modeling Is All You Need for Lifelong Language Learning
Figure 4 for LAMAL: LAnguage Modeling Is All You Need for Lifelong Language Learning
Viaarxiv icon

Cross-Lingual Transfer Learning for Question Answering

Add code
Jul 13, 2019
Figure 1 for Cross-Lingual Transfer Learning for Question Answering
Figure 2 for Cross-Lingual Transfer Learning for Question Answering
Figure 3 for Cross-Lingual Transfer Learning for Question Answering
Figure 4 for Cross-Lingual Transfer Learning for Question Answering
Viaarxiv icon

Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation

Add code
Apr 16, 2019
Figure 1 for Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation
Figure 2 for Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation
Figure 3 for Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation
Figure 4 for Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation
Viaarxiv icon

Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering

Add code
Apr 16, 2019
Figure 1 for Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Figure 2 for Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Figure 3 for Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Figure 4 for Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Viaarxiv icon