Picture for Helen Meng

Helen Meng

UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization

Add code
Jan 26, 2024
Figure 1 for UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization
Figure 2 for UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization
Figure 3 for UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization
Figure 4 for UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization
Viaarxiv icon

SCNet: Sparse Compression Network for Music Source Separation

Add code
Jan 24, 2024
Figure 1 for SCNet: Sparse Compression Network for Music Source Separation
Figure 2 for SCNet: Sparse Compression Network for Music Source Separation
Figure 3 for SCNet: Sparse Compression Network for Music Source Separation
Figure 4 for SCNet: Sparse Compression Network for Music Source Separation
Viaarxiv icon

Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation

Add code
Jan 15, 2024
Figure 1 for Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation
Figure 2 for Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation
Figure 3 for Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation
Figure 4 for Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation
Viaarxiv icon

Cross-Speaker Encoding Network for Multi-Talker Speech Recognition

Add code
Jan 08, 2024
Figure 1 for Cross-Speaker Encoding Network for Multi-Talker Speech Recognition
Figure 2 for Cross-Speaker Encoding Network for Multi-Talker Speech Recognition
Figure 3 for Cross-Speaker Encoding Network for Multi-Talker Speech Recognition
Figure 4 for Cross-Speaker Encoding Network for Multi-Talker Speech Recognition
Viaarxiv icon

Consistent and Relevant: Rethink the Query Embedding in General Sound Separation

Add code
Dec 24, 2023
Figure 1 for Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
Figure 2 for Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
Figure 3 for Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
Figure 4 for Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
Viaarxiv icon

StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis

Add code
Dec 19, 2023
Viaarxiv icon

SimCalib: Graph Neural Network Calibration based on Similarity between Nodes

Add code
Dec 19, 2023
Viaarxiv icon

neural concatenative singing voice conversion: rethinking concatenation-based approach for one-shot singing voice conversion

Add code
Dec 08, 2023
Viaarxiv icon

Injecting linguistic knowledge into BERT for Dialogue State Tracking

Add code
Nov 27, 2023
Figure 1 for Injecting linguistic knowledge into BERT for Dialogue State Tracking
Figure 2 for Injecting linguistic knowledge into BERT for Dialogue State Tracking
Figure 3 for Injecting linguistic knowledge into BERT for Dialogue State Tracking
Figure 4 for Injecting linguistic knowledge into BERT for Dialogue State Tracking
Viaarxiv icon

DASA: Difficulty-Aware Semantic Augmentation for Speaker Verification

Add code
Oct 18, 2023
Figure 1 for DASA: Difficulty-Aware Semantic Augmentation for Speaker Verification
Figure 2 for DASA: Difficulty-Aware Semantic Augmentation for Speaker Verification
Figure 3 for DASA: Difficulty-Aware Semantic Augmentation for Speaker Verification
Figure 4 for DASA: Difficulty-Aware Semantic Augmentation for Speaker Verification
Viaarxiv icon