Mos Dataset


Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum

Add code
Jan 20, 2026
Viaarxiv icon

Subjective evaluation of UHD video coded using VVC with LCEVC and ML-VVC

Add code
Jan 15, 2026
Viaarxiv icon

Song Aesthetics Evaluation with Multi-Stem Attention and Hierarchical Uncertainty Modeling

Add code
Jan 18, 2026
Viaarxiv icon

Summary of The Inaugural Music Source Restoration Challenge

Add code
Jan 07, 2026
Viaarxiv icon

ManchuTTS: Towards High-Quality Manchu Speech Synthesis via Flow Matching and Hierarchical Text Representation

Add code
Dec 27, 2025
Viaarxiv icon

Effect of Activation Function and Model Optimizer on the Performance of Human Activity Recognition System Using Various Deep Learning Models

Add code
Dec 23, 2025
Viaarxiv icon

EchoMark: Perceptual Acoustic Environment Transfer with Watermark-Embedded Room Impulse Response

Add code
Nov 09, 2025
Viaarxiv icon

CAMP-VQA: Caption-Embedded Multimodal Perception for No-Reference Quality Assessment of Compressed Video

Add code
Nov 10, 2025
Viaarxiv icon

From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling

Add code
Oct 01, 2025
Viaarxiv icon

UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models

Add code
Oct 26, 2025
Figure 1 for UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models
Figure 2 for UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models
Figure 3 for UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models
Figure 4 for UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models
Viaarxiv icon