Alert button
Picture for Rongzhi Gu

Rongzhi Gu

Alert button

Gull: A Generative Multifunctional Audio Codec

Add code
Bookmark button
Alert button
Apr 07, 2024
Yi Luo, Jianwei Yu, Hangting Chen, Rongzhi Gu, Chao Weng

Viaarxiv icon

SECap: Speech Emotion Captioning with Large Language Model

Add code
Bookmark button
Alert button
Dec 23, 2023
Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shixiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu

Viaarxiv icon

ReZero: Region-customizable Sound Extraction

Add code
Bookmark button
Alert button
Aug 31, 2023
Rongzhi Gu, Yi Luo

Figure 1 for ReZero: Region-customizable Sound Extraction
Figure 2 for ReZero: Region-customizable Sound Extraction
Figure 3 for ReZero: Region-customizable Sound Extraction
Figure 4 for ReZero: Region-customizable Sound Extraction
Viaarxiv icon

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression

Add code
Bookmark button
Alert button
Aug 21, 2023
Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng

Figure 1 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Figure 2 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Figure 3 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Figure 4 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Viaarxiv icon

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track

Add code
Bookmark button
Alert button
Aug 14, 2023
Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman Solovyev, Alexander Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji

Figure 1 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Figure 2 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Figure 3 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Figure 4 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Viaarxiv icon

Fast Random Approximation of Multi-channel Room Impulse Response

Add code
Bookmark button
Alert button
Apr 17, 2023
Yi Luo, Rongzhi Gu

Figure 1 for Fast Random Approximation of Multi-channel Room Impulse Response
Figure 2 for Fast Random Approximation of Multi-channel Room Impulse Response
Figure 3 for Fast Random Approximation of Multi-channel Room Impulse Response
Figure 4 for Fast Random Approximation of Multi-channel Room Impulse Response
Viaarxiv icon

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty

Add code
Bookmark button
Alert button
Feb 27, 2023
Rongzhi Gu, Shi-Xiong Zhang, Dong Yu

Figure 1 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 2 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 3 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 4 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Viaarxiv icon

Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation

Add code
Bookmark button
Alert button
Dec 24, 2022
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu

Figure 1 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 2 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 3 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 4 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Viaarxiv icon

Probing Deep Speaker Embeddings for Speaker-related Tasks

Add code
Bookmark button
Alert button
Dec 14, 2022
Zifeng Zhao, Ding Pan, Junyi Peng, Rongzhi Gu

Figure 1 for Probing Deep Speaker Embeddings for Speaker-related Tasks
Figure 2 for Probing Deep Speaker Embeddings for Speaker-related Tasks
Figure 3 for Probing Deep Speaker Embeddings for Speaker-related Tasks
Figure 4 for Probing Deep Speaker Embeddings for Speaker-related Tasks
Viaarxiv icon