Picture for Shujie Liu

Shujie Liu

Separating Long-Form Speech with Group-Wise Permutation Invariant Training

Add code
Nov 17, 2021
Figure 1 for Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Figure 2 for Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Figure 3 for Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Figure 4 for Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Viaarxiv icon

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

Add code
Oct 29, 2021
Figure 1 for WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Figure 2 for WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Figure 3 for WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Figure 4 for WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Viaarxiv icon

Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction

Add code
Oct 28, 2021
Figure 1 for Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
Figure 2 for Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
Figure 3 for Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
Figure 4 for Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
Viaarxiv icon

Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding

Add code
Oct 23, 2021
Figure 1 for Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding
Figure 2 for Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding
Figure 3 for Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding
Figure 4 for Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding
Viaarxiv icon

SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing

Add code
Oct 14, 2021
Figure 1 for SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing
Figure 2 for SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing
Figure 3 for SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing
Figure 4 for SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing
Viaarxiv icon

Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification

Add code
Oct 12, 2021
Figure 1 for Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification
Figure 2 for Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification
Figure 3 for Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification
Figure 4 for Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification
Viaarxiv icon

UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training

Add code
Oct 12, 2021
Figure 1 for UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training
Figure 2 for UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training
Figure 3 for UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training
Figure 4 for UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training
Viaarxiv icon

Multi-View Self-Attention Based Transformer for Speaker Recognition

Add code
Oct 11, 2021
Figure 1 for Multi-View Self-Attention Based Transformer for Speaker Recognition
Figure 2 for Multi-View Self-Attention Based Transformer for Speaker Recognition
Figure 3 for Multi-View Self-Attention Based Transformer for Speaker Recognition
Figure 4 for Multi-View Self-Attention Based Transformer for Speaker Recognition
Viaarxiv icon

Jointly Learning to Repair Code and Generate Commit Message

Add code
Sep 25, 2021
Figure 1 for Jointly Learning to Repair Code and Generate Commit Message
Figure 2 for Jointly Learning to Repair Code and Generate Commit Message
Figure 3 for Jointly Learning to Repair Code and Generate Commit Message
Figure 4 for Jointly Learning to Repair Code and Generate Commit Message
Viaarxiv icon

Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation

Add code
Sep 12, 2021
Figure 1 for Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation
Figure 2 for Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation
Figure 3 for Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation
Figure 4 for Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation
Viaarxiv icon