Picture for Zhaoheng Ni

Zhaoheng Ni

High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

Add code
Jul 04, 2024
Viaarxiv icon

URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement

Add code
Jun 07, 2024
Viaarxiv icon

An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement

Add code
Jan 18, 2024
Figure 1 for An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement
Figure 2 for An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement
Figure 3 for An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement
Figure 4 for An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement
Viaarxiv icon

On The Open Prompt Challenge In Conditional Audio Generation

Add code
Nov 01, 2023
Figure 1 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 2 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 3 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 4 for On The Open Prompt Challenge In Conditional Audio Generation
Viaarxiv icon

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Oct 27, 2023
Figure 1 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 2 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 3 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 4 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Viaarxiv icon

Exploring Speech Enhancement for Low-resource Speech Synthesis

Add code
Sep 19, 2023
Figure 1 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 2 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 3 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 4 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Viaarxiv icon

FoleyGen: Visually-Guided Audio Generation

Add code
Sep 19, 2023
Figure 1 for FoleyGen: Visually-Guided Audio Generation
Figure 2 for FoleyGen: Visually-Guided Audio Generation
Figure 3 for FoleyGen: Visually-Guided Audio Generation
Figure 4 for FoleyGen: Visually-Guided Audio Generation
Viaarxiv icon

Stack-and-Delay: a new codebook pattern for music generation

Add code
Sep 15, 2023
Figure 1 for Stack-and-Delay: a new codebook pattern for music generation
Figure 2 for Stack-and-Delay: a new codebook pattern for music generation
Figure 3 for Stack-and-Delay: a new codebook pattern for music generation
Figure 4 for Stack-and-Delay: a new codebook pattern for music generation
Viaarxiv icon

Enhance audio generation controllability through representation similarity regularization

Add code
Sep 15, 2023
Figure 1 for Enhance audio generation controllability through representation similarity regularization
Figure 2 for Enhance audio generation controllability through representation similarity regularization
Figure 3 for Enhance audio generation controllability through representation similarity regularization
Figure 4 for Enhance audio generation controllability through representation similarity regularization
Viaarxiv icon

Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute

Add code
Jun 11, 2023
Figure 1 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 2 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 3 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 4 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Viaarxiv icon