Alert button
Picture for Shiliang Zhang

Shiliang Zhang

Alert button

BAT: Boundary aware transducer for memory-efficient and low-latency ASR

Add code
Bookmark button
Alert button
May 19, 2023
Keyu An, Xian Shi, Shiliang Zhang

Figure 1 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Figure 2 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Figure 3 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Figure 4 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Viaarxiv icon

FunASR: A Fundamental End-to-End Speech Recognition Toolkit

Add code
Bookmark button
Alert button
May 18, 2023
Zhifu Gao, Zerui Li, Jiaming Wang, Haoneng Luo, Xian Shi, Mengzhe Chen, Yabin Li, Lingyun Zuo, Zhihao Du, Zhangyu Xiao, Shiliang Zhang

Figure 1 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Figure 2 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Figure 3 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Figure 4 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Viaarxiv icon

Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System

Add code
Bookmark button
Alert button
May 18, 2023
Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan

Figure 1 for Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
Figure 2 for Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
Figure 3 for Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
Figure 4 for Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
Viaarxiv icon

TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization

Add code
Bookmark button
Alert button
Mar 08, 2023
Jiaming Wang, Zhihao Du, Shiliang Zhang

Figure 1 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Figure 2 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Figure 3 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Figure 4 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Viaarxiv icon

Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model

Add code
Bookmark button
Alert button
Jan 29, 2023
Xian Shi, Yanni Chen, Shiliang Zhang, Zhijie Yan

Figure 1 for Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model
Figure 2 for Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model
Figure 3 for Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model
Figure 4 for Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model
Viaarxiv icon

MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition

Add code
Bookmark button
Alert button
Nov 29, 2022
Xiaohuan Zhou, Jiaming Wang, Zeyu Cui, Shiliang Zhang, Zhijie Yan, Jingren Zhou, Chang Zhou

Figure 1 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Figure 2 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Figure 3 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Figure 4 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Viaarxiv icon

Deep Active Learning for Computer Vision: Past and Future

Add code
Bookmark button
Alert button
Nov 27, 2022
Rinyoichi Takezoe, Xu Liu, Shunan Mao, Marco Tianyu Chen, Zhanpeng Feng, Shiliang Zhang, Xiaoyu Wang

Figure 1 for Deep Active Learning for Computer Vision: Past and Future
Figure 2 for Deep Active Learning for Computer Vision: Past and Future
Figure 3 for Deep Active Learning for Computer Vision: Past and Future
Figure 4 for Deep Active Learning for Computer Vision: Past and Future
Viaarxiv icon

Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis

Add code
Bookmark button
Alert button
Nov 18, 2022
Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan

Figure 1 for Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis
Figure 2 for Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis
Figure 3 for Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis
Figure 4 for Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis
Viaarxiv icon

ParCNetV2: Oversized Kernel with Enhanced Attention

Add code
Bookmark button
Alert button
Nov 14, 2022
Ruihan Xu, Haokui Zhang, Wenze Hu, Shiliang Zhang, Xiaoyu Wang

Figure 1 for ParCNetV2: Oversized Kernel with Enhanced Attention
Figure 2 for ParCNetV2: Oversized Kernel with Enhanced Attention
Figure 3 for ParCNetV2: Oversized Kernel with Enhanced Attention
Figure 4 for ParCNetV2: Oversized Kernel with Enhanced Attention
Viaarxiv icon

A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings

Add code
Bookmark button
Alert button
Nov 01, 2022
Mohan Shi, Jie Zhang, Zhihao Du, Fan Yu, Shiliang Zhang, Li-Rong Dai

Figure 1 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Figure 2 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Figure 3 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Figure 4 for A Comparative Study on multichannel Speaker-attributed automatic speech recognition in Multi-party Meetings
Viaarxiv icon