Picture for Wenbin Wang

Wenbin Wang

GLOBE: A High-quality English Corpus with Global Accents for Zero-shot Speaker Adaptive Text-to-Speech

Add code
Jun 21, 2024
Figure 1 for GLOBE: A High-quality English Corpus with Global Accents for Zero-shot Speaker Adaptive Text-to-Speech
Figure 2 for GLOBE: A High-quality English Corpus with Global Accents for Zero-shot Speaker Adaptive Text-to-Speech
Figure 3 for GLOBE: A High-quality English Corpus with Global Accents for Zero-shot Speaker Adaptive Text-to-Speech
Figure 4 for GLOBE: A High-quality English Corpus with Global Accents for Zero-shot Speaker Adaptive Text-to-Speech
Viaarxiv icon

Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach

Add code
Jun 13, 2024
Figure 1 for Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach
Figure 2 for Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach
Figure 3 for Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach
Figure 4 for Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach
Viaarxiv icon

USAT: A Universal Speaker-Adaptive Text-to-Speech Approach

Add code
Apr 28, 2024
Viaarxiv icon

Principled Preferential Bayesian Optimization

Add code
Feb 08, 2024
Viaarxiv icon

WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge

Add code
Jan 12, 2024
Figure 1 for WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge
Figure 2 for WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge
Figure 3 for WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge
Figure 4 for WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge
Viaarxiv icon

Dynamic Association Learning of Self-Attention and Convolution in Image Restoration

Add code
Nov 09, 2023
Figure 1 for Dynamic Association Learning of Self-Attention and Convolution in Image Restoration
Viaarxiv icon

Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations

Add code
Aug 24, 2023
Figure 1 for Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Figure 2 for Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Figure 3 for Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Figure 4 for Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Viaarxiv icon

Super-resolution of Ray-tracing Channel Simulation via Attention Mechanism based Deep Learning Model

Add code
Jan 21, 2023
Viaarxiv icon

Pose-disentangled Contrastive Learning for Self-supervised Facial Representation

Add code
Nov 24, 2022
Figure 1 for Pose-disentangled Contrastive Learning for Self-supervised Facial Representation
Figure 2 for Pose-disentangled Contrastive Learning for Self-supervised Facial Representation
Figure 3 for Pose-disentangled Contrastive Learning for Self-supervised Facial Representation
Figure 4 for Pose-disentangled Contrastive Learning for Self-supervised Facial Representation
Viaarxiv icon

AutoLV: Automatic Lecture Video Generator

Add code
Sep 19, 2022
Figure 1 for AutoLV: Automatic Lecture Video Generator
Figure 2 for AutoLV: Automatic Lecture Video Generator
Figure 3 for AutoLV: Automatic Lecture Video Generator
Figure 4 for AutoLV: Automatic Lecture Video Generator
Viaarxiv icon