Alert button
Picture for Wei Zou

Wei Zou

Alert button

Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Aug 17, 2022
Goutham Rajendran, Wei Zou

Figure 1 for Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition
Figure 2 for Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition
Figure 3 for Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition
Figure 4 for Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition
Viaarxiv icon

DSLA: Dynamic smooth label assignment for efficient anchor-free object detection

Add code
Bookmark button
Alert button
Aug 01, 2022
Hu Su, Yonghao He, Jiabin Zhang, Wei Zou, Bin Fan

Figure 1 for DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Figure 2 for DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Figure 3 for DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Figure 4 for DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Viaarxiv icon

Audio-Visual Wake Word Spotting System For MISP Challenge 2021

Add code
Bookmark button
Alert button
Apr 20, 2022
Yanguang Xu, Jianwei Sun, Yang Han, Shuaijiang Zhao, Chaoyang Mei, Tingwei Guo, Shuran Zhou, Chuandong Xie, Wei Zou, Xiangang Li, Shuran Zhou, Chuandong Xie, Wei Zou, Xiangang Li

Figure 1 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Figure 2 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Figure 3 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Figure 4 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Viaarxiv icon

Time Domain Adversarial Voice Conversion for ADD 2022

Add code
Bookmark button
Alert button
Apr 20, 2022
Cheng Wen, Tingwei Guo, Xingjun Tan, Rui Yan, Shuran Zhou, Chuandong Xie, Wei Zou, Xiangang Li

Figure 1 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 2 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 3 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 4 for Time Domain Adversarial Voice Conversion for ADD 2022
Viaarxiv icon

Audio Deep Fake Detection System with Neural Stitching for ADD 2022

Add code
Bookmark button
Alert button
Apr 20, 2022
Rui Yan, Cheng Wen, Shuran Zhou, Tingwei Guo, Wei Zou, Xiangang Li

Figure 1 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Figure 2 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Figure 3 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Figure 4 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Viaarxiv icon

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

Add code
Bookmark button
Alert button
Jun 13, 2021
Guoguo Chen, Shuzhou Chai, Guanbo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Yujun Wang, Zhao You, Zhiyong Yan

Figure 1 for GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Figure 2 for GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Figure 3 for GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Figure 4 for GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Viaarxiv icon

Semantic Data Augmentation for End-to-End Mandarin Speech Recognition

Add code
Bookmark button
Alert button
Apr 26, 2021
Jianwei Sun, Zhiyuan Tang, Hengxin Yin, Wei Wang, Xi Zhao, Shuaijiang Zhao, Xiaoning Lei, Wei Zou, Xiangang Li

Figure 1 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Figure 2 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Figure 3 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Figure 4 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Viaarxiv icon

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning

Add code
Bookmark button
Alert button
Oct 27, 2020
Dongwei Jiang, Wubo Li, Miao Cao, Ruixiong Zhang, Wei Zou, Kun Han, Xiangang Li

Figure 1 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 2 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 3 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 4 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Viaarxiv icon

TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog

Add code
Bookmark button
Alert button
Oct 21, 2020
Wubo Li, Dongwei Jiang, Wei Zou, Xiangang Li

Figure 1 for TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog
Figure 2 for TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog
Figure 3 for TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog
Figure 4 for TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog
Viaarxiv icon

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition

Add code
Bookmark button
Alert button
Jun 23, 2020
Dongwei Jiang, Wubo Li, Ruixiong Zhang, Miao Cao, Ne Luo, Yang Han, Wei Zou, Xiangang Li

Figure 1 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 2 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 3 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 4 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Viaarxiv icon