Picture for Yu Pan

Yu Pan

QiNN-QJ: A Quantum-inspired Neural Network with Quantum Jump for Multimodal Sentiment Analysis

Add code
Oct 31, 2025
Viaarxiv icon

S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder

Add code
Jun 16, 2025
Figure 1 for S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder
Figure 2 for S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder
Viaarxiv icon

Learning and Interpreting Gravitational-Wave Features from CNNs with a Random Forest Approach

Add code
May 26, 2025
Viaarxiv icon

ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech

Add code
May 20, 2025
Figure 1 for ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech
Figure 2 for ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech
Figure 3 for ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech
Figure 4 for ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech
Viaarxiv icon

Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

Add code
May 07, 2025
Figure 1 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 2 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 3 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 4 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Viaarxiv icon

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Add code
Apr 10, 2025
Viaarxiv icon

Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models

Add code
Apr 08, 2025
Figure 1 for Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models
Figure 2 for Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models
Figure 3 for Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models
Figure 4 for Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models
Viaarxiv icon

IDInit: A Universal and Stable Initialization Method for Neural Network Training

Add code
Mar 06, 2025
Viaarxiv icon

Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion Models

Add code
Feb 28, 2025
Viaarxiv icon

Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech

Add code
Feb 05, 2025
Figure 1 for Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech
Figure 2 for Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech
Figure 3 for Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech
Figure 4 for Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech
Viaarxiv icon