Picture for Ning Cheng

Ning Cheng

RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis

Add code
May 27, 2024
Figure 1 for RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis
Figure 2 for RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis
Figure 3 for RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis
Figure 4 for RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis
Viaarxiv icon

Transformer in Touch: A Survey

Add code
May 21, 2024
Viaarxiv icon

Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL

Add code
May 10, 2024
Viaarxiv icon

MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion

Add code
May 02, 2024
Figure 1 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Figure 2 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Figure 3 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Figure 4 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Viaarxiv icon

Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation

Add code
May 01, 2024
Viaarxiv icon

EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning

Add code
Apr 30, 2024
Figure 1 for EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning
Figure 2 for EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning
Figure 3 for EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning
Figure 4 for EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning
Viaarxiv icon

EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization

Add code
Apr 30, 2024
Viaarxiv icon

CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition

Add code
Apr 30, 2024
Viaarxiv icon

QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering

Add code
Apr 30, 2024
Figure 1 for QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Figure 2 for QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Figure 3 for QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Figure 4 for QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Viaarxiv icon

Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset

Add code
Mar 14, 2024
Figure 1 for Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset
Figure 2 for Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset
Figure 3 for Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset
Figure 4 for Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset
Viaarxiv icon