Alert button
Picture for Xiaofei Li

Xiaofei Li

Alert button

Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers

Add code
Bookmark button
Alert button
Mar 12, 2024
Changsheng Quan, Xiaofei Li

Figure 1 for Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Figure 2 for Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Figure 3 for Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Viaarxiv icon

Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

Add code
Bookmark button
Alert button
Feb 22, 2024
Rui Zhou, Xian Li, Ying Fang, Xiaofei Li

Viaarxiv icon

Deep learning and random light structuring ensure robust free-space communications

Add code
Bookmark button
Alert button
Jan 18, 2024
Xiaofei Li, Yu Wang, Xin Liu, Yuan Ma, Yangjian Cai, Sergey A. Ponomarenko, Xianlong Liu

Viaarxiv icon

Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer

Add code
Bookmark button
Alert button
Dec 01, 2023
Bing Yang, Xiaofei Li

Viaarxiv icon

Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors

Add code
Bookmark button
Alert button
Sep 25, 2023
Di Liang, Nian Shao, Xiaofei Li

Viaarxiv icon

RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function

Add code
Bookmark button
Alert button
Sep 15, 2023
Pengyu Wang, Xiaofei Li

Viaarxiv icon

Fine-tune the pretrained ATST model for sound event detection

Add code
Bookmark button
Alert button
Sep 15, 2023
Nian Shao, Xian Li, Xiaofei Li

Figure 1 for Fine-tune the pretrained ATST model for sound event detection
Figure 2 for Fine-tune the pretrained ATST model for sound event detection
Figure 3 for Fine-tune the pretrained ATST model for sound event detection
Figure 4 for Fine-tune the pretrained ATST model for sound event detection
Viaarxiv icon

Unimodal Aggregation for CTC-based Speech Recognition

Add code
Bookmark button
Alert button
Sep 15, 2023
Ying Fang, Xiaofei Li

Figure 1 for Unimodal Aggregation for CTC-based Speech Recognition
Figure 2 for Unimodal Aggregation for CTC-based Speech Recognition
Figure 3 for Unimodal Aggregation for CTC-based Speech Recognition
Figure 4 for Unimodal Aggregation for CTC-based Speech Recognition
Viaarxiv icon

SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation

Add code
Bookmark button
Alert button
Jul 31, 2023
Changsheng Quan, Xiaofei Li

Figure 1 for SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation
Figure 2 for SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation
Figure 3 for SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation
Figure 4 for SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation
Viaarxiv icon

Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks

Add code
Bookmark button
Alert button
Jun 07, 2023
Xian Li, Nian Shao, Xiaofei Li

Figure 1 for Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks
Figure 2 for Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks
Figure 3 for Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks
Figure 4 for Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks
Viaarxiv icon