Picture for Rongzhi Gu

Rongzhi Gu

Gull: A Generative Multifunctional Audio Codec

Add code
Apr 07, 2024
Figure 1 for Gull: A Generative Multifunctional Audio Codec
Figure 2 for Gull: A Generative Multifunctional Audio Codec
Figure 3 for Gull: A Generative Multifunctional Audio Codec
Figure 4 for Gull: A Generative Multifunctional Audio Codec
Viaarxiv icon

SECap: Speech Emotion Captioning with Large Language Model

Add code
Dec 23, 2023
Figure 1 for SECap: Speech Emotion Captioning with Large Language Model
Figure 2 for SECap: Speech Emotion Captioning with Large Language Model
Figure 3 for SECap: Speech Emotion Captioning with Large Language Model
Figure 4 for SECap: Speech Emotion Captioning with Large Language Model
Viaarxiv icon

ReZero: Region-customizable Sound Extraction

Add code
Aug 31, 2023
Figure 1 for ReZero: Region-customizable Sound Extraction
Figure 2 for ReZero: Region-customizable Sound Extraction
Figure 3 for ReZero: Region-customizable Sound Extraction
Figure 4 for ReZero: Region-customizable Sound Extraction
Viaarxiv icon

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression

Add code
Aug 21, 2023
Figure 1 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Figure 2 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Figure 3 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Figure 4 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Viaarxiv icon

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track

Add code
Aug 14, 2023
Figure 1 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Figure 2 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Figure 3 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Figure 4 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Viaarxiv icon

Fast Random Approximation of Multi-channel Room Impulse Response

Add code
Apr 17, 2023
Figure 1 for Fast Random Approximation of Multi-channel Room Impulse Response
Figure 2 for Fast Random Approximation of Multi-channel Room Impulse Response
Figure 3 for Fast Random Approximation of Multi-channel Room Impulse Response
Figure 4 for Fast Random Approximation of Multi-channel Room Impulse Response
Viaarxiv icon

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty

Add code
Feb 27, 2023
Figure 1 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 2 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 3 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 4 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Viaarxiv icon

Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation

Dec 24, 2022
Figure 1 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 2 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 3 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 4 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Viaarxiv icon

Probing Deep Speaker Embeddings for Speaker-related Tasks

Add code
Dec 14, 2022
Figure 1 for Probing Deep Speaker Embeddings for Speaker-related Tasks
Figure 2 for Probing Deep Speaker Embeddings for Speaker-related Tasks
Figure 3 for Probing Deep Speaker Embeddings for Speaker-related Tasks
Figure 4 for Probing Deep Speaker Embeddings for Speaker-related Tasks
Viaarxiv icon

High Fidelity Speech Enhancement with Band-split RNN

Add code
Dec 01, 2022
Figure 1 for High Fidelity Speech Enhancement with Band-split RNN
Figure 2 for High Fidelity Speech Enhancement with Band-split RNN
Viaarxiv icon