Picture for Ya Li

Ya Li

The Renaissance of Expert Systems: Optical Recognition of Printed Chinese Jianpu Musical Scores with Lyrics

Add code
Dec 15, 2025
Viaarxiv icon

HQ-SVC: Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios

Add code
Nov 15, 2025
Viaarxiv icon

SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding

Add code
Sep 18, 2025
Figure 1 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Figure 2 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Figure 3 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Figure 4 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Viaarxiv icon

ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation

Add code
Jul 03, 2025
Viaarxiv icon

MGFF-TDNN: A Multi-Granularity Feature Fusion TDNN Model with Depth-Wise Separable Module for Speaker Verification

Add code
May 06, 2025
Viaarxiv icon

Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling

Add code
Mar 05, 2025
Figure 1 for Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling
Figure 2 for Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling
Figure 3 for Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling
Figure 4 for Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling
Viaarxiv icon

Psy-Copilot: Visual Chain of Thought for Counseling

Add code
Mar 05, 2025
Figure 1 for Psy-Copilot: Visual Chain of Thought for Counseling
Figure 2 for Psy-Copilot: Visual Chain of Thought for Counseling
Figure 3 for Psy-Copilot: Visual Chain of Thought for Counseling
Figure 4 for Psy-Copilot: Visual Chain of Thought for Counseling
Viaarxiv icon

Mel-Refine: A Plug-and-Play Approach to Refine Mel-Spectrogram in Audio Generation

Add code
Dec 11, 2024
Viaarxiv icon

Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition

Add code
Aug 18, 2024
Figure 1 for Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition
Figure 2 for Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition
Figure 3 for Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition
Figure 4 for Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition
Viaarxiv icon

SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion

Add code
Jun 09, 2024
Figure 1 for SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
Figure 2 for SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
Figure 3 for SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
Figure 4 for SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion
Viaarxiv icon