Picture for Zhaoqing Li

Zhaoqing Li

Towards Data-free and Training-free Compression for Speech Foundation Models Using Parameter Clustering

Add code
Jun 11, 2026
Viaarxiv icon

UNISON: A Unified Sound Generation and Editing Framework via Deep LLM Fusion

Add code
May 29, 2026
Viaarxiv icon

Beyond Text Prompts: Visual-to-Visual Generation as A Unified Paradigm

Add code
May 12, 2026
Viaarxiv icon

Flow2GAN: Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-step High-Fidelity Audio Generation

Add code
Dec 29, 2025
Viaarxiv icon

ZipVoice: Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Add code
Jun 16, 2025
Viaarxiv icon

Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates

Add code
May 28, 2025
Figure 1 for Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates
Figure 2 for Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates
Figure 3 for Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates
Figure 4 for Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates
Viaarxiv icon

Towards One-bit ASR: Extremely Low-bit Conformer Quantization Using Co-training and Stochastic Precision

Add code
May 27, 2025
Viaarxiv icon

Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models

Add code
May 27, 2025
Figure 1 for Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models
Figure 2 for Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models
Figure 3 for Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models
Figure 4 for Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models
Viaarxiv icon

Automated Heterogeneous Network learning with Non-Recursive Message Passing

Add code
Jan 10, 2025
Viaarxiv icon

Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition

Add code
Jan 08, 2025
Figure 1 for Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
Figure 2 for Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
Figure 3 for Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
Figure 4 for Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
Viaarxiv icon