Picture for Pengyuan Zhang

Pengyuan Zhang

Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features

Add code
Sep 29, 2023
Figure 1 for Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features
Figure 2 for Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features
Figure 3 for Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features
Figure 4 for Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features
Viaarxiv icon

The Impact of Silence on Speech Anti-Spoofing

Add code
Sep 21, 2023
Figure 1 for The Impact of Silence on Speech Anti-Spoofing
Figure 2 for The Impact of Silence on Speech Anti-Spoofing
Figure 3 for The Impact of Silence on Speech Anti-Spoofing
Figure 4 for The Impact of Silence on Speech Anti-Spoofing
Viaarxiv icon

Improving Short Utterance Anti-Spoofing with AASIST2

Add code
Sep 15, 2023
Figure 1 for Improving Short Utterance Anti-Spoofing with AASIST2
Figure 2 for Improving Short Utterance Anti-Spoofing with AASIST2
Figure 3 for Improving Short Utterance Anti-Spoofing with AASIST2
Figure 4 for Improving Short Utterance Anti-Spoofing with AASIST2
Viaarxiv icon

One-Class Knowledge Distillation for Spoofing Speech Detection

Add code
Sep 15, 2023
Figure 1 for One-Class Knowledge Distillation for Spoofing Speech Detection
Figure 2 for One-Class Knowledge Distillation for Spoofing Speech Detection
Figure 3 for One-Class Knowledge Distillation for Spoofing Speech Detection
Viaarxiv icon

Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder

Add code
Sep 02, 2023
Figure 1 for Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder
Figure 2 for Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder
Figure 3 for Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder
Figure 4 for Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder
Viaarxiv icon

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition

Add code
Aug 12, 2023
Figure 1 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 2 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 3 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 4 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Viaarxiv icon

Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture

Add code
Jul 05, 2023
Figure 1 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 2 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 3 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 4 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Viaarxiv icon

The HCCL system for VoxCeleb Speaker Recognition Challenge 2022

Add code
May 22, 2023
Figure 1 for The HCCL system for VoxCeleb Speaker Recognition Challenge 2022
Figure 2 for The HCCL system for VoxCeleb Speaker Recognition Challenge 2022
Figure 3 for The HCCL system for VoxCeleb Speaker Recognition Challenge 2022
Figure 4 for The HCCL system for VoxCeleb Speaker Recognition Challenge 2022
Viaarxiv icon

Progressive Sub-Graph Clustering Algorithm for Semi-Supervised Domain Adaptation Speaker Verification

Add code
May 22, 2023
Viaarxiv icon

ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement

Add code
May 15, 2023
Viaarxiv icon