Picture for Luyao Cheng

Luyao Cheng

Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization

Add code
Aug 22, 2024
Viaarxiv icon

Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision

Add code
Jun 17, 2024
Figure 1 for Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision
Figure 2 for Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision
Figure 3 for Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision
Figure 4 for Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision
Viaarxiv icon

ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency

Add code
Jun 04, 2024
Figure 1 for ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency
Figure 2 for ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency
Figure 3 for ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency
Figure 4 for ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency
Viaarxiv icon

3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization

Add code
Mar 29, 2024
Figure 1 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Figure 2 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Figure 3 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Figure 4 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Viaarxiv icon

Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

Add code
Sep 19, 2023
Figure 1 for Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation
Figure 2 for Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation
Figure 3 for Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation
Figure 4 for Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation
Viaarxiv icon

3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement

Add code
Jun 28, 2023
Figure 1 for 3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement
Figure 2 for 3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement
Figure 3 for 3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement
Figure 4 for 3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement
Viaarxiv icon

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

Add code
May 22, 2023
Figure 1 for An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification
Figure 2 for An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification
Figure 3 for An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification
Figure 4 for An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification
Viaarxiv icon

Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization

Add code
May 22, 2023
Figure 1 for Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization
Figure 2 for Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization
Figure 3 for Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization
Figure 4 for Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization
Viaarxiv icon

CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking

Add code
Mar 02, 2023
Figure 1 for CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking
Figure 2 for CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking
Figure 3 for CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking
Figure 4 for CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking
Viaarxiv icon

Pushing the limits of self-supervised speaker verification using regularized distillation framework

Add code
Nov 08, 2022
Figure 1 for Pushing the limits of self-supervised speaker verification using regularized distillation framework
Figure 2 for Pushing the limits of self-supervised speaker verification using regularized distillation framework
Figure 3 for Pushing the limits of self-supervised speaker verification using regularized distillation framework
Figure 4 for Pushing the limits of self-supervised speaker verification using regularized distillation framework
Viaarxiv icon