Picture for Ya Li

Ya Li

The Renaissance of Expert Systems: Optical Recognition of Printed Chinese Jianpu Musical Scores with Lyrics

Add code
Dec 15, 2025
Viaarxiv icon

HQ-SVC: Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios

Add code
Nov 15, 2025
Viaarxiv icon

SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding

Add code
Sep 18, 2025
Figure 1 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Figure 2 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Figure 3 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Figure 4 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Viaarxiv icon

ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation

Add code
Jul 03, 2025
Viaarxiv icon

MGFF-TDNN: A Multi-Granularity Feature Fusion TDNN Model with Depth-Wise Separable Module for Speaker Verification

Add code
May 06, 2025
Viaarxiv icon

Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling

Add code
Mar 05, 2025
Figure 1 for Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling
Figure 2 for Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling
Figure 3 for Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling
Figure 4 for Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling
Viaarxiv icon

Psy-Copilot: Visual Chain of Thought for Counseling

Add code
Mar 05, 2025
Viaarxiv icon

Mel-Refine: A Plug-and-Play Approach to Refine Mel-Spectrogram in Audio Generation

Add code
Dec 11, 2024
Viaarxiv icon

Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition

Add code
Aug 18, 2024
Viaarxiv icon

SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion

Add code
Jun 09, 2024
Figure 1 for SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
Figure 2 for SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
Figure 3 for SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
Figure 4 for SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
Viaarxiv icon