Picture for Pengcheng Guo

Pengcheng Guo

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets

Add code
May 06, 2024
Figure 1 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 2 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 3 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 4 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Viaarxiv icon

Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder

Add code
Apr 08, 2024
Figure 1 for Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder
Figure 2 for Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder
Figure 3 for Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder
Figure 4 for Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder
Viaarxiv icon

An audio-quality-based multi-strategy approach for target speaker extraction in the MISP 2023 Challenge

Add code
Jan 08, 2024
Viaarxiv icon

The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023

Add code
Jan 07, 2024
Figure 1 for The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023
Figure 2 for The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023
Viaarxiv icon

MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition

Add code
Jan 07, 2024
Viaarxiv icon

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Add code
Jan 07, 2024
Figure 1 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Figure 2 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Viaarxiv icon

Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

Add code
Dec 15, 2023
Viaarxiv icon

Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition

Add code
Nov 17, 2023
Figure 1 for Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition
Figure 2 for Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition
Figure 3 for Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition
Figure 4 for Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition
Viaarxiv icon

SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR

Add code
Oct 07, 2023
Figure 1 for SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR
Figure 2 for SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR
Figure 3 for SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR
Figure 4 for SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR
Viaarxiv icon

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

Add code
Sep 27, 2023
Figure 1 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 2 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 3 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 4 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Viaarxiv icon