Alert button
Picture for Shiliang Zhang

Shiliang Zhang

Alert button

MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario

Add code
Bookmark button
Alert button
Oct 11, 2022
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yuhao Liang, Zhihao Du, Yuxiao Lin, Lei Xie

Figure 1 for MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
Figure 2 for MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
Figure 3 for MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
Figure 4 for MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
Viaarxiv icon

ALBench: A Framework for Evaluating Active Learning in Object Detection

Add code
Bookmark button
Alert button
Aug 10, 2022
Zhanpeng Feng, Shiliang Zhang, Rinyoichi Takezoe, Wenze Hu, Manmohan Chandraker, Li-Jia Li, Vijay K. Narayanan, Xiaoyu Wang

Figure 1 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 2 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 3 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 4 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Viaarxiv icon

Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jun 20, 2022
Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan

Figure 1 for Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Figure 2 for Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Figure 3 for Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Figure 4 for Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Viaarxiv icon

A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings

Add code
Bookmark button
Alert button
Apr 01, 2022
Fan Yu, Zhihao Du, Shiliang Zhang, Yuxiao Lin, Lei Xie

Figure 1 for A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Figure 2 for A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Figure 3 for A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Figure 4 for A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Viaarxiv icon

Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios

Add code
Bookmark button
Alert button
Mar 31, 2022
Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan

Figure 1 for Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios
Figure 2 for Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios
Figure 3 for Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios
Figure 4 for Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios
Viaarxiv icon

Extended vehicle energy dataset (eVED): an enhanced large-scale dataset for deep learning on vehicle trip energy consumption

Add code
Bookmark button
Alert button
Mar 16, 2022
Shiliang Zhang, Dyako Fatih, Fahmi Abdulqadir, Tobias Schwarz, Xuehui Ma

Figure 1 for Extended vehicle energy dataset (eVED): an enhanced large-scale dataset for deep learning on vehicle trip energy consumption
Figure 2 for Extended vehicle energy dataset (eVED): an enhanced large-scale dataset for deep learning on vehicle trip energy consumption
Figure 3 for Extended vehicle energy dataset (eVED): an enhanced large-scale dataset for deep learning on vehicle trip energy consumption
Figure 4 for Extended vehicle energy dataset (eVED): an enhanced large-scale dataset for deep learning on vehicle trip energy consumption
Viaarxiv icon

Contextualize differential privacy in image database: a lightweight image differential privacy approach based on principle component analysis inverse

Add code
Bookmark button
Alert button
Feb 19, 2022
Shiliang Zhang, Xuehui Ma, Hui Cao, Tengyuan Zhao, Yajie Yu, Zhuzhu Wang

Figure 1 for Contextualize differential privacy in image database: a lightweight image differential privacy approach based on principle component analysis inverse
Figure 2 for Contextualize differential privacy in image database: a lightweight image differential privacy approach based on principle component analysis inverse
Figure 3 for Contextualize differential privacy in image database: a lightweight image differential privacy approach based on principle component analysis inverse
Figure 4 for Contextualize differential privacy in image database: a lightweight image differential privacy approach based on principle component analysis inverse
Viaarxiv icon

ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech

Add code
Bookmark button
Alert button
Feb 16, 2022
Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao

Figure 1 for ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech
Figure 2 for ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech
Figure 3 for ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech
Viaarxiv icon