Alert button
Picture for Xin Xu

Xin Xu

Alert button

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

Add code
Bookmark button
Alert button
Oct 12, 2021
Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng

Figure 1 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 2 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 3 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 4 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Viaarxiv icon

Learning Practically Feasible Policies for Online 3D Bin Packing

Add code
Bookmark button
Alert button
Sep 07, 2021
Hang Zhao, Chenyang Zhu, Xin Xu, Hui Huang, Kai Xu

Figure 1 for Learning Practically Feasible Policies for Online 3D Bin Packing
Figure 2 for Learning Practically Feasible Policies for Online 3D Bin Packing
Figure 3 for Learning Practically Feasible Policies for Online 3D Bin Packing
Figure 4 for Learning Practically Feasible Policies for Online 3D Bin Packing
Viaarxiv icon

Unsupervised Video Person Re-identification via Noise and Hard frame Aware Clustering

Add code
Bookmark button
Alert button
Jun 10, 2021
Pengyu Xie, Xin Xu, Zheng Wang, Toshihiko Yamasaki

Figure 1 for Unsupervised Video Person Re-identification via Noise and Hard frame Aware Clustering
Figure 2 for Unsupervised Video Person Re-identification via Noise and Hard frame Aware Clustering
Figure 3 for Unsupervised Video Person Re-identification via Noise and Hard frame Aware Clustering
Figure 4 for Unsupervised Video Person Re-identification via Noise and Hard frame Aware Clustering
Viaarxiv icon

A LiDAR Assisted Control Module with High Precision in Parking Scenarios for Autonomous Driving Vehicle

Add code
Bookmark button
Alert button
May 02, 2021
Xin Xu, Yu Dong, Fan Zhu

Figure 1 for A LiDAR Assisted Control Module with High Precision in Parking Scenarios for Autonomous Driving Vehicle
Figure 2 for A LiDAR Assisted Control Module with High Precision in Parking Scenarios for Autonomous Driving Vehicle
Figure 3 for A LiDAR Assisted Control Module with High Precision in Parking Scenarios for Autonomous Driving Vehicle
Figure 4 for A LiDAR Assisted Control Module with High Precision in Parking Scenarios for Autonomous Driving Vehicle
Viaarxiv icon

Multi-objective Feature Selection with Missing Data in Classification

Add code
Bookmark button
Alert button
Apr 18, 2021
Yu Xue, Yihang Tang, Xin Xu, Jiayu Liang, Ferrante Neri

Figure 1 for Multi-objective Feature Selection with Missing Data in Classification
Figure 2 for Multi-objective Feature Selection with Missing Data in Classification
Figure 3 for Multi-objective Feature Selection with Missing Data in Classification
Figure 4 for Multi-objective Feature Selection with Missing Data in Classification
Viaarxiv icon

AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario

Add code
Bookmark button
Alert button
Apr 08, 2021
Yihui Fu, Luyao Cheng, Shubo Lv, Yukai Jv, Yuxiang Kong, Zhuo Chen, Yanxin Hu, Lei Xie, Jian Wu, Hui Bu, Xin Xu, Jun Du, Jingdong Chen

Figure 1 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Figure 2 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Figure 3 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Figure 4 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Viaarxiv icon

The Multi-speaker Multi-style Voice Cloning Challenge 2021

Add code
Bookmark button
Alert button
Apr 05, 2021
Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu

Figure 1 for The Multi-speaker Multi-style Voice Cloning Challenge 2021
Figure 2 for The Multi-speaker Multi-style Voice Cloning Challenge 2021
Viaarxiv icon

INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing

Add code
Bookmark button
Alert button
Apr 02, 2021
Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang

Figure 1 for INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Figure 2 for INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Figure 3 for INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Viaarxiv icon

CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels

Add code
Bookmark button
Alert button
Feb 19, 2021
Yongqian Xiao, Xin Xu, QianLi Lin

Figure 1 for CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels
Figure 2 for CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels
Figure 3 for CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels
Figure 4 for CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels
Viaarxiv icon