Alert button
Picture for Xilin Jiang

Xilin Jiang

Alert button

Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation

Add code
Bookmark button
Alert button
Mar 27, 2024
Xilin Jiang, Cong Han, Nima Mesgarani

Viaarxiv icon

Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience

Add code
Bookmark button
Alert button
Feb 06, 2024
Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani

Viaarxiv icon

Exploring Self-Supervised Contrastive Learning of Spatial Sound Event Representation

Add code
Bookmark button
Alert button
Sep 27, 2023
Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani

Viaarxiv icon

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Add code
Bookmark button
Alert button
Sep 18, 2023
Yinghao Aaron Li, Cong Han, Xilin Jiang, Nima Mesgarani

Figure 1 for HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Figure 2 for HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Figure 3 for HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Viaarxiv icon

DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes

Add code
Bookmark button
Alert button
May 29, 2023
Xilin Jiang, Yinghao Aaron Li, Nima Mesgarani

Figure 1 for DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes
Figure 2 for DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes
Figure 3 for DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes
Figure 4 for DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes
Viaarxiv icon

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Add code
Bookmark button
Alert button
Jan 20, 2023
Yinghao Aaron Li, Cong Han, Xilin Jiang, Nima Mesgarani

Figure 1 for Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Figure 2 for Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Figure 3 for Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Figure 4 for Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Viaarxiv icon

Learning Representations for New Sound Classes With Continual Self-Supervised Learning

Add code
Bookmark button
Alert button
May 15, 2022
Zhepei Wang, Cem Subakan, Xilin Jiang, Junkai Wu, Efthymios Tzinis, Mirco Ravanelli, Paris Smaragdis

Figure 1 for Learning Representations for New Sound Classes With Continual Self-Supervised Learning
Figure 2 for Learning Representations for New Sound Classes With Continual Self-Supervised Learning
Figure 3 for Learning Representations for New Sound Classes With Continual Self-Supervised Learning
Figure 4 for Learning Representations for New Sound Classes With Continual Self-Supervised Learning
Viaarxiv icon

Compute and memory efficient universal sound source separation

Add code
Bookmark button
Alert button
Mar 03, 2021
Efthymios Tzinis, Zhepei Wang, Xilin Jiang, Paris Smaragdis

Figure 1 for Compute and memory efficient universal sound source separation
Figure 2 for Compute and memory efficient universal sound source separation
Figure 3 for Compute and memory efficient universal sound source separation
Figure 4 for Compute and memory efficient universal sound source separation
Viaarxiv icon