Picture for Ning Cheng

Ning Cheng

Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset

Add code
Mar 14, 2024
Figure 1 for Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset
Figure 2 for Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset
Figure 3 for Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset
Figure 4 for Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset
Viaarxiv icon

Medical Speech Symptoms Classification via Disentangled Representation

Add code
Mar 08, 2024
Figure 1 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 2 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 3 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 4 for Medical Speech Symptoms Classification via Disentangled Representation
Viaarxiv icon

Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Add code
Feb 01, 2024
Figure 1 for Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Figure 2 for Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Figure 3 for Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Figure 4 for Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Viaarxiv icon

Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval

Add code
Jan 18, 2024
Figure 1 for Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval
Figure 2 for Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval
Figure 3 for Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval
Figure 4 for Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval
Viaarxiv icon

Leveraging Biases in Large Language Models: "bias-kNN'' for Effective Few-Shot Learning

Add code
Jan 18, 2024
Viaarxiv icon

ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis

Add code
Jan 16, 2024
Figure 1 for ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis
Figure 2 for ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis
Figure 3 for ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis
Figure 4 for ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis
Viaarxiv icon

EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model

Add code
Jan 16, 2024
Viaarxiv icon

DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation

Add code
Nov 29, 2023
Figure 1 for DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation
Figure 2 for DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation
Figure 3 for DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation
Figure 4 for DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation
Viaarxiv icon

CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation

Add code
Nov 15, 2023
Figure 1 for CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation
Figure 2 for CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation
Figure 3 for CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation
Figure 4 for CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation
Viaarxiv icon

CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding

Add code
Nov 15, 2023
Figure 1 for CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding
Figure 2 for CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding
Figure 3 for CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding
Figure 4 for CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding
Viaarxiv icon