Alert button
Picture for Pengcheng Guo

Pengcheng Guo

Alert button

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

Add code
Bookmark button
Alert button
Jun 01, 2023
Tianyi Xu, Zhanheng Yang, Kaixun Huang, Pengcheng Guo, Ao Zhang, Biao Li, Changru Chen, Chao Li, Lei Xie

Figure 1 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Figure 2 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Figure 3 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Figure 4 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Viaarxiv icon

Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification

Add code
Bookmark button
Alert button
May 30, 2023
Qing Wang, Jixun Yao, Ziqian Wang, Pengcheng Guo, Lei Xie

Figure 1 for Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification
Figure 2 for Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification
Figure 3 for Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification
Figure 4 for Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification
Viaarxiv icon

BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR

Add code
Bookmark button
Alert button
May 23, 2023
Yuhao Liang, Fan Yu, Yangze Li, Pengcheng Guo, Shiliang Zhang, Qian Chen, Lei Xie

Figure 1 for BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
Figure 2 for BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
Figure 3 for BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
Figure 4 for BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
Viaarxiv icon

TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition

Add code
Bookmark button
Alert button
May 23, 2023
Hongfei Xue, Qijie Shao, Peikun Chen, Pengcheng Guo, Lei Xie, Jie Liu

Figure 1 for TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition
Figure 2 for TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition
Figure 3 for TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition
Figure 4 for TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition
Viaarxiv icon

Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

Add code
Bookmark button
Alert button
May 21, 2023
Kaixun Huang, Ao Zhang, Zhanheng Yang, Pengcheng Guo, Bingshen Mu, Tianyi Xu, Lei Xie

Figure 1 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Figure 2 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Figure 3 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Figure 4 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Viaarxiv icon

VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting

Add code
Bookmark button
Alert button
Mar 14, 2023
Ao Zhang, He Wang, Pengcheng Guo, Yihui Fu, Lei Xie, Yingying Gao, Shilei Zhang, Junlan Feng

Figure 1 for VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Figure 2 for VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Figure 3 for VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Figure 4 for VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
Viaarxiv icon

The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge

Add code
Bookmark button
Alert button
Mar 11, 2023
Pengcheng Guo, He Wang, Bingshen Mu, Ao Zhang, Peikun Chen

Figure 1 for The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge
Figure 2 for The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge
Figure 3 for The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge
Viaarxiv icon

TESSP: Text-Enhanced Self-Supervised Speech Pre-training

Add code
Bookmark button
Alert button
Nov 24, 2022
Zhuoyuan Yao, Shuo Ren, Sanyuan Chen, Ziyang Ma, Pengcheng Guo, Lei Xie

Figure 1 for TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Figure 2 for TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Figure 3 for TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Figure 4 for TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Viaarxiv icon

Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling

Add code
Bookmark button
Alert button
Nov 06, 2022
Jixun Yao, Qing Wang, Yi Lei, Pengcheng Guo, Lei Xie, Namin Wang, Jie Liu

Figure 1 for Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling
Figure 2 for Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling
Figure 3 for Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling
Figure 4 for Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling
Viaarxiv icon