Picture for Zixing Zhang

Zixing Zhang

DEBATE: A Dataset for Disentangling Textual Ambiguity in Mandarin Through Speech

Add code
Jun 09, 2025
Viaarxiv icon

ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis

Add code
Dec 16, 2024
Figure 1 for ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Figure 2 for ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Figure 3 for ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Figure 4 for ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Viaarxiv icon

ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models

Add code
Nov 14, 2024
Figure 1 for ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models
Figure 2 for ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models
Figure 3 for ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models
Figure 4 for ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models
Viaarxiv icon

Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition

Add code
Nov 14, 2024
Figure 1 for Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition
Figure 2 for Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition
Figure 3 for Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition
Figure 4 for Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition
Viaarxiv icon

Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling

Add code
Sep 25, 2024
Figure 1 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 2 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 3 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 4 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Viaarxiv icon

DaRec: A Disentangled Alignment Framework for Large Language Model and Recommender System

Add code
Aug 15, 2024
Viaarxiv icon

ESIHGNN: Event-State Interactions Infused Heterogeneous Graph Neural Network for Conversational Emotion Recognition

Add code
May 07, 2024
Viaarxiv icon

HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech

Add code
May 07, 2024
Viaarxiv icon

Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation

Add code
May 07, 2024
Viaarxiv icon

Adaptive Speech Emotion Representation Learning Based On Dynamic Graph

Add code
May 07, 2024
Viaarxiv icon