Picture for Xiulian Peng

Xiulian Peng

Masked Audio Modeling with CLAP and Multi-Objective Learning

Add code
Jan 29, 2024
Viaarxiv icon

Low-latency Speech Enhancement via Speech Token Generation

Add code
Oct 20, 2023
Figure 1 for Low-latency Speech Enhancement via Speech Token Generation
Figure 2 for Low-latency Speech Enhancement via Speech Token Generation
Figure 3 for Low-latency Speech Enhancement via Speech Token Generation
Figure 4 for Low-latency Speech Enhancement via Speech Token Generation
Viaarxiv icon

Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition

Add code
Sep 29, 2023
Figure 1 for Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition
Figure 2 for Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition
Figure 3 for Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition
Figure 4 for Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition
Viaarxiv icon

ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression

Add code
May 26, 2023
Figure 1 for ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression
Figure 2 for ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression
Figure 3 for ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression
Figure 4 for ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression
Viaarxiv icon

DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation

Add code
Mar 14, 2023
Figure 1 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 2 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 3 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 4 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Viaarxiv icon

Contrast-PLC: Contrastive Learning for Packet Loss Concealment

Add code
Feb 26, 2023
Figure 1 for Contrast-PLC: Contrastive Learning for Packet Loss Concealment
Figure 2 for Contrast-PLC: Contrastive Learning for Packet Loss Concealment
Figure 3 for Contrast-PLC: Contrastive Learning for Packet Loss Concealment
Figure 4 for Contrast-PLC: Contrastive Learning for Packet Loss Concealment
Viaarxiv icon

Time-Variance Aware Real-Time Speech Enhancement

Add code
Feb 25, 2023
Figure 1 for Time-Variance Aware Real-Time Speech Enhancement
Figure 2 for Time-Variance Aware Real-Time Speech Enhancement
Figure 3 for Time-Variance Aware Real-Time Speech Enhancement
Figure 4 for Time-Variance Aware Real-Time Speech Enhancement
Viaarxiv icon

Improving Speech Enhancement via Event-based Query

Add code
Feb 24, 2023
Figure 1 for Improving Speech Enhancement via Event-based Query
Figure 2 for Improving Speech Enhancement via Event-based Query
Figure 3 for Improving Speech Enhancement via Event-based Query
Figure 4 for Improving Speech Enhancement via Event-based Query
Viaarxiv icon

Real-time speech enhancement with dynamic attention span

Add code
Feb 21, 2023
Figure 1 for Real-time speech enhancement with dynamic attention span
Figure 2 for Real-time speech enhancement with dynamic attention span
Figure 3 for Real-time speech enhancement with dynamic attention span
Figure 4 for Real-time speech enhancement with dynamic attention span
Viaarxiv icon

Disentangled Feature Learning for Real-Time Neural Speech Coding

Add code
Nov 22, 2022
Figure 1 for Disentangled Feature Learning for Real-Time Neural Speech Coding
Figure 2 for Disentangled Feature Learning for Real-Time Neural Speech Coding
Figure 3 for Disentangled Feature Learning for Real-Time Neural Speech Coding
Figure 4 for Disentangled Feature Learning for Real-Time Neural Speech Coding
Viaarxiv icon