Picture for Yixuan Zhou

Yixuan Zhou

Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models

Add code
Jul 18, 2024
Figure 1 for Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models
Figure 2 for Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models
Figure 3 for Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models
Figure 4 for Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models
Viaarxiv icon

The THU-HCSI Multi-Speaker Multi-Lingual Few-Shot Voice Cloning System for LIMMITS'24 Challenge

Add code
Apr 25, 2024
Viaarxiv icon

BatchNorm-based Weakly Supervised Video Anomaly Detection

Add code
Nov 26, 2023
Viaarxiv icon

X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention

Add code
Oct 12, 2023
Figure 1 for X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention
Figure 2 for X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention
Figure 3 for X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention
Figure 4 for X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention
Viaarxiv icon

Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts

Add code
Sep 22, 2023
Figure 1 for Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts
Figure 2 for Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts
Figure 3 for Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts
Figure 4 for Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts
Viaarxiv icon

Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis

Add code
Aug 31, 2023
Figure 1 for Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis
Figure 2 for Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis
Figure 3 for Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis
Figure 4 for Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis
Viaarxiv icon

MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly Detection

Add code
Aug 29, 2023
Figure 1 for MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly Detection
Figure 2 for MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly Detection
Figure 3 for MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly Detection
Figure 4 for MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly Detection
Viaarxiv icon

ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition

Add code
Aug 15, 2023
Figure 1 for ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition
Figure 2 for ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition
Figure 3 for ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition
Figure 4 for ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition
Viaarxiv icon

MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis

Add code
Jul 29, 2023
Figure 1 for MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis
Figure 2 for MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis
Figure 3 for MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis
Figure 4 for MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis
Viaarxiv icon

AnoOnly: Semi-Supervised Anomaly Detection without Loss on Normal Data

Add code
May 30, 2023
Figure 1 for AnoOnly: Semi-Supervised Anomaly Detection without Loss on Normal Data
Figure 2 for AnoOnly: Semi-Supervised Anomaly Detection without Loss on Normal Data
Figure 3 for AnoOnly: Semi-Supervised Anomaly Detection without Loss on Normal Data
Figure 4 for AnoOnly: Semi-Supervised Anomaly Detection without Loss on Normal Data
Viaarxiv icon