Picture for Guoqing Zhao

Guoqing Zhao

Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

Add code
Mar 19, 2024
Figure 1 for Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation
Figure 2 for Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation
Figure 3 for Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation
Figure 4 for Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation
Viaarxiv icon

SELM: Speech Enhancement Using Discrete Tokens and Language Models

Add code
Dec 15, 2023
Viaarxiv icon

Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning

Add code
Oct 26, 2023
Figure 1 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Figure 2 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Figure 3 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Figure 4 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Viaarxiv icon

Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification

Add code
Oct 07, 2023
Figure 1 for Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification
Figure 2 for Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification
Figure 3 for Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification
Figure 4 for Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification
Viaarxiv icon

Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification

Add code
Sep 25, 2023
Figure 1 for Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification
Figure 2 for Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification
Figure 3 for Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification
Figure 4 for Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification
Viaarxiv icon

VoxBlink: X-Large Speaker Verification Dataset on Camera

Add code
Aug 23, 2023
Figure 1 for VoxBlink: X-Large Speaker Verification Dataset on Camera
Figure 2 for VoxBlink: X-Large Speaker Verification Dataset on Camera
Figure 3 for VoxBlink: X-Large Speaker Verification Dataset on Camera
Figure 4 for VoxBlink: X-Large Speaker Verification Dataset on Camera
Viaarxiv icon

The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023

Add code
Aug 17, 2023
Figure 1 for The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 2 for The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 3 for The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 4 for The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023
Viaarxiv icon

The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023

Add code
Aug 17, 2023
Figure 1 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 2 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 3 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 4 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Viaarxiv icon

The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task

Add code
Jul 10, 2023
Figure 1 for The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task
Figure 2 for The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task
Figure 3 for The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task
Figure 4 for The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task
Viaarxiv icon

TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding

Add code
May 29, 2023
Figure 1 for TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding
Figure 2 for TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding
Figure 3 for TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding
Figure 4 for TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding
Viaarxiv icon