Picture for Lin-shan Lee

Lin-shan Lee

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

Add code
Feb 06, 2024
Figure 1 for REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Figure 2 for REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Figure 3 for REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Figure 4 for REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Viaarxiv icon

SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering

Add code
Jan 24, 2024
Figure 1 for SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Figure 2 for SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Figure 3 for SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Viaarxiv icon

DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering

Add code
Mar 26, 2022
Figure 1 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Figure 2 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Figure 3 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Figure 4 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Viaarxiv icon

Towards Lifelong Learning of End-to-end ASR

Add code
Apr 04, 2021
Figure 1 for Towards Lifelong Learning of End-to-end ASR
Figure 2 for Towards Lifelong Learning of End-to-end ASR
Figure 3 for Towards Lifelong Learning of End-to-end ASR
Figure 4 for Towards Lifelong Learning of End-to-end ASR
Viaarxiv icon

FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention

Add code
Oct 27, 2020
Figure 1 for FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention
Figure 2 for FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention
Figure 3 for FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention
Figure 4 for FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention
Viaarxiv icon

Defending Your Voice: Adversarial Attack on Voice Conversion

Add code
May 18, 2020
Figure 1 for Defending Your Voice: Adversarial Attack on Voice Conversion
Figure 2 for Defending Your Voice: Adversarial Attack on Voice Conversion
Figure 3 for Defending Your Voice: Adversarial Attack on Voice Conversion
Figure 4 for Defending Your Voice: Adversarial Attack on Voice Conversion
Viaarxiv icon

End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Layer-wise Transfer Learning

Add code
May 05, 2020
Figure 1 for End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Layer-wise Transfer Learning
Figure 2 for End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Layer-wise Transfer Learning
Figure 3 for End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Layer-wise Transfer Learning
Figure 4 for End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Layer-wise Transfer Learning
Viaarxiv icon

Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding

Add code
Oct 28, 2019
Figure 1 for Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding
Figure 2 for Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding
Figure 3 for Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding
Viaarxiv icon

Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning

Add code
Oct 28, 2019
Figure 1 for Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning
Figure 2 for Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning
Figure 3 for Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning
Figure 4 for Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning
Viaarxiv icon

Interrupted and cascaded permutation invariant training for speech separation

Add code
Oct 28, 2019
Figure 1 for Interrupted and cascaded permutation invariant training for speech separation
Figure 2 for Interrupted and cascaded permutation invariant training for speech separation
Figure 3 for Interrupted and cascaded permutation invariant training for speech separation
Figure 4 for Interrupted and cascaded permutation invariant training for speech separation
Viaarxiv icon