Picture for Ruoyi Zhang

Ruoyi Zhang

Fish Audio S2 Technical Report

Add code
Mar 11, 2026
Viaarxiv icon

Physics Encoded Spatial and Temporal Generative Adversarial Network for Tropical Cyclone Image Super-resolution

Add code
Feb 19, 2026
Viaarxiv icon

FCPE: A Fast Context-based Pitch Estimation Model

Add code
Sep 18, 2025
Figure 1 for FCPE: A Fast Context-based Pitch Estimation Model
Figure 2 for FCPE: A Fast Context-based Pitch Estimation Model
Figure 3 for FCPE: A Fast Context-based Pitch Estimation Model
Figure 4 for FCPE: A Fast Context-based Pitch Estimation Model
Viaarxiv icon

Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection

Add code
Apr 19, 2025
Viaarxiv icon

Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis

Add code
Nov 02, 2024
Figure 1 for Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Figure 2 for Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Figure 3 for Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Figure 4 for Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Viaarxiv icon