Picture for Jing Xiao

Jing Xiao

Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training

Add code
May 22, 2024
Figure 1 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Figure 2 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Figure 3 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Figure 4 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Viaarxiv icon

Listen, Disentangle, and Control: Controllable Speech-Driven Talking Head Generation

Add code
May 12, 2024
Viaarxiv icon

MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion

Add code
May 02, 2024
Figure 1 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Figure 2 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Figure 3 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Figure 4 for MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Viaarxiv icon

Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation

Add code
May 01, 2024
Viaarxiv icon

CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition

Add code
Apr 30, 2024
Viaarxiv icon

EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning

Add code
Apr 30, 2024
Figure 1 for EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning
Figure 2 for EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning
Figure 3 for EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning
Figure 4 for EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning
Viaarxiv icon

EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization

Add code
Apr 30, 2024
Viaarxiv icon

QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering

Add code
Apr 30, 2024
Figure 1 for QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Figure 2 for QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Figure 3 for QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Figure 4 for QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Viaarxiv icon

Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning

Add code
Apr 24, 2024
Figure 1 for Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning
Figure 2 for Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning
Figure 3 for Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning
Figure 4 for Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning
Viaarxiv icon

Retrieval-Augmented Audio Deepfake Detection

Add code
Apr 23, 2024
Figure 1 for Retrieval-Augmented Audio Deepfake Detection
Figure 2 for Retrieval-Augmented Audio Deepfake Detection
Figure 3 for Retrieval-Augmented Audio Deepfake Detection
Figure 4 for Retrieval-Augmented Audio Deepfake Detection
Viaarxiv icon