Picture for Shidong Shang

Shidong Shang

Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models

Add code
Jul 02, 2024
Figure 1 for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models
Figure 2 for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models
Figure 3 for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models
Figure 4 for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models
Viaarxiv icon

A High Fidelity and Low Complexity Neural Audio Coding

Add code
Oct 17, 2023
Viaarxiv icon

Inter-SubNet: Speech Enhancement with Subband Interaction

Add code
May 09, 2023
Figure 1 for Inter-SubNet: Speech Enhancement with Subband Interaction
Figure 2 for Inter-SubNet: Speech Enhancement with Subband Interaction
Figure 3 for Inter-SubNet: Speech Enhancement with Subband Interaction
Figure 4 for Inter-SubNet: Speech Enhancement with Subband Interaction
Viaarxiv icon

TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

Add code
Mar 14, 2023
Figure 1 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Figure 2 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Figure 3 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Viaarxiv icon

Speech Enhancement with Fullband-Subband Cross-Attention Network

Add code
Nov 10, 2022
Figure 1 for Speech Enhancement with Fullband-Subband Cross-Attention Network
Figure 2 for Speech Enhancement with Fullband-Subband Cross-Attention Network
Figure 3 for Speech Enhancement with Fullband-Subband Cross-Attention Network
Viaarxiv icon

Local-global speaker representation for target speaker extraction

Add code
Oct 28, 2022
Figure 1 for Local-global speaker representation for target speaker extraction
Figure 2 for Local-global speaker representation for target speaker extraction
Figure 3 for Local-global speaker representation for target speaker extraction
Figure 4 for Local-global speaker representation for target speaker extraction
Viaarxiv icon

Speech Enhancement with Intelligent Neural Homomorphic Synthesis

Add code
Oct 28, 2022
Figure 1 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Figure 2 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Figure 3 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Figure 4 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Viaarxiv icon

ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications

Add code
Apr 01, 2022
Figure 1 for ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Figure 2 for ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Figure 3 for ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Figure 4 for ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Viaarxiv icon

INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing

Add code
Apr 02, 2021
Figure 1 for INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Figure 2 for INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Figure 3 for INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Viaarxiv icon