Picture for Xiaofei Wang

Xiaofei Wang

Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology

Add code
Jun 11, 2024
Figure 1 for Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology
Figure 2 for Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology
Figure 3 for Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology
Figure 4 for Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology
Viaarxiv icon

An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS

Add code
Jun 09, 2024
Figure 1 for An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
Figure 2 for An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
Figure 3 for An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
Viaarxiv icon

Total-Duration-Aware Duration Modeling for Text-to-Speech Systems

Add code
Jun 06, 2024
Figure 1 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Figure 2 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Figure 3 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Figure 4 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Viaarxiv icon

Domain Game: Disentangle Anatomical Feature for Single Domain Generalized Segmentation

Add code
Jun 04, 2024
Figure 1 for Domain Game: Disentangle Anatomical Feature for Single Domain Generalized Segmentation
Figure 2 for Domain Game: Disentangle Anatomical Feature for Single Domain Generalized Segmentation
Figure 3 for Domain Game: Disentangle Anatomical Feature for Single Domain Generalized Segmentation
Figure 4 for Domain Game: Disentangle Anatomical Feature for Single Domain Generalized Segmentation
Viaarxiv icon

TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation

Add code
May 28, 2024
Viaarxiv icon

Socialized Learning: A Survey of the Paradigm Shift for Edge Intelligence in Networked Systems

Add code
Apr 20, 2024
Viaarxiv icon

Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics

Add code
Apr 19, 2024
Viaarxiv icon

CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations

Add code
Apr 10, 2024
Figure 1 for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
Figure 2 for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
Figure 3 for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
Figure 4 for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
Viaarxiv icon

Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection

Add code
Mar 12, 2024
Figure 1 for Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection
Figure 2 for Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection
Figure 3 for Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection
Figure 4 for Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection
Viaarxiv icon

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Add code
Feb 12, 2024
Viaarxiv icon