Picture for Yuanchao Li

Yuanchao Li

Exploring Gender Bias in Alzheimer's Disease Detection: Insights from Mandarin and Greek Speech Perception

Add code
Jul 16, 2025
Figure 1 for Exploring Gender Bias in Alzheimer's Disease Detection: Insights from Mandarin and Greek Speech Perception
Figure 2 for Exploring Gender Bias in Alzheimer's Disease Detection: Insights from Mandarin and Greek Speech Perception
Figure 3 for Exploring Gender Bias in Alzheimer's Disease Detection: Insights from Mandarin and Greek Speech Perception
Figure 4 for Exploring Gender Bias in Alzheimer's Disease Detection: Insights from Mandarin and Greek Speech Perception
Viaarxiv icon

Leveraging Cascaded Binary Classification and Multimodal Fusion for Dementia Detection through Spontaneous Speech

Add code
May 26, 2025
Figure 1 for Leveraging Cascaded Binary Classification and Multimodal Fusion for Dementia Detection through Spontaneous Speech
Figure 2 for Leveraging Cascaded Binary Classification and Multimodal Fusion for Dementia Detection through Spontaneous Speech
Figure 3 for Leveraging Cascaded Binary Classification and Multimodal Fusion for Dementia Detection through Spontaneous Speech
Viaarxiv icon

Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information

Add code
May 21, 2025
Viaarxiv icon

Revisiting Acoustic Similarity in Emotional Speech and Music via Self-Supervised Representations

Add code
Sep 26, 2024
Figure 1 for Revisiting Acoustic Similarity in Emotional Speech and Music via Self-Supervised Representations
Figure 2 for Revisiting Acoustic Similarity in Emotional Speech and Music via Self-Supervised Representations
Figure 3 for Revisiting Acoustic Similarity in Emotional Speech and Music via Self-Supervised Representations
Figure 4 for Revisiting Acoustic Similarity in Emotional Speech and Music via Self-Supervised Representations
Viaarxiv icon

Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling

Add code
Sep 25, 2024
Figure 1 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 2 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 3 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 4 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Viaarxiv icon

Cross-lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models

Add code
Sep 25, 2024
Figure 1 for Cross-lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models
Figure 2 for Cross-lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models
Figure 3 for Cross-lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models
Figure 4 for Cross-lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models
Viaarxiv icon

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

Add code
Sep 17, 2024
Figure 1 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 2 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 3 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 4 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Viaarxiv icon

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Add code
Jul 15, 2024
Figure 1 for Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Figure 2 for Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Figure 3 for Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Figure 4 for Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Viaarxiv icon

Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques

Add code
Jun 12, 2024
Figure 1 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 2 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 3 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 4 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Viaarxiv icon

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem

Add code
May 30, 2024
Figure 1 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Figure 2 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Figure 3 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Figure 4 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Viaarxiv icon