Alert button
Picture for Youzheng Wu

Youzheng Wu

Alert button

Leveraging Label Information for Multimodal Emotion Recognition

Sep 05, 2023
Peiying Wang, Sunlu Zeng, Junqing Chen, Lu Fan, Meng Chen, Youzheng Wu, Xiaodong He

Figure 1 for Leveraging Label Information for Multimodal Emotion Recognition
Figure 2 for Leveraging Label Information for Multimodal Emotion Recognition
Figure 3 for Leveraging Label Information for Multimodal Emotion Recognition
Figure 4 for Leveraging Label Information for Multimodal Emotion Recognition
Viaarxiv icon

AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets

Jun 16, 2023
Yu Lu, Junwei Bao, Zichen Ma, Xiaoguang Han, Youzheng Wu, Shuguang Cui, Xiaodong He

Figure 1 for AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets
Figure 2 for AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets
Figure 3 for AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets
Figure 4 for AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets
Viaarxiv icon

OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition

Jun 05, 2023
Li Fu, Siqi Li, Qingtao Li, Fangzhu Li, Liping Deng, Lu Fan, Meng Chen, Youzheng Wu, Xiaodong He

Figure 1 for OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition
Figure 2 for OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition
Figure 3 for OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition
Figure 4 for OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition
Viaarxiv icon

SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation

Nov 27, 2022
Huaishao Luo, Junwei Bao, Youzheng Wu, Xiaodong He, Tianrui Li

Figure 1 for SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Figure 2 for SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Figure 3 for SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Figure 4 for SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Viaarxiv icon

Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement

Nov 22, 2022
Wei Song, Yanghao Yue, Ya-jie Zhang, Zhengchen Zhang, Youzheng Wu, Xiaodong He

Figure 1 for Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement
Figure 2 for Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement
Figure 3 for Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement
Figure 4 for Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement
Viaarxiv icon

MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy

Nov 11, 2022
Ya-Jie Zhang, Wei Song, Yanghao Yue, Zhengchen Zhang, Youzheng Wu, Xiaodong He

Figure 1 for MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy
Figure 2 for MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy
Figure 3 for MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy
Figure 4 for MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy
Viaarxiv icon

MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking

Nov 11, 2022
Haoning Zhang, Junwei Bao, Haipeng Sun, Youzheng Wu, Wenye Li, Shuguang Cui, Xiaodong He

Figure 1 for MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking
Figure 2 for MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking
Figure 3 for MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking
Figure 4 for MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking
Viaarxiv icon

MuGER$^2$: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question Answering

Oct 19, 2022
Yingyao Wang, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He, Tiejun Zhao

Figure 1 for MuGER$^2$: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question Answering
Figure 2 for MuGER$^2$: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question Answering
Figure 3 for MuGER$^2$: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question Answering
Figure 4 for MuGER$^2$: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question Answering
Viaarxiv icon