Picture for Shiliang Zhang

Shiliang Zhang

BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR

Add code
May 23, 2023
Figure 1 for BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
Figure 2 for BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
Figure 3 for BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
Figure 4 for BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
Viaarxiv icon

Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction

Add code
May 21, 2023
Figure 1 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Figure 2 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Figure 3 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Figure 4 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Viaarxiv icon

CASA-ASR: Context-Aware Speaker-Attributed ASR

Add code
May 21, 2023
Figure 1 for CASA-ASR: Context-Aware Speaker-Attributed ASR
Figure 2 for CASA-ASR: Context-Aware Speaker-Attributed ASR
Figure 3 for CASA-ASR: Context-Aware Speaker-Attributed ASR
Figure 4 for CASA-ASR: Context-Aware Speaker-Attributed ASR
Viaarxiv icon

BAT: Boundary aware transducer for memory-efficient and low-latency ASR

Add code
May 19, 2023
Figure 1 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Figure 2 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Figure 3 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Figure 4 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Viaarxiv icon

FunASR: A Fundamental End-to-End Speech Recognition Toolkit

Add code
May 18, 2023
Figure 1 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Figure 2 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Figure 3 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Figure 4 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Viaarxiv icon

TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization

Add code
Mar 08, 2023
Figure 1 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Figure 2 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Figure 3 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Figure 4 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Viaarxiv icon

Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model

Add code
Jan 29, 2023
Figure 1 for Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model
Figure 2 for Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model
Figure 3 for Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model
Figure 4 for Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model
Viaarxiv icon

MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition

Add code
Nov 29, 2022
Figure 1 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Figure 2 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Figure 3 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Figure 4 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Viaarxiv icon

Deep Active Learning for Computer Vision: Past and Future

Add code
Nov 27, 2022
Figure 1 for Deep Active Learning for Computer Vision: Past and Future
Figure 2 for Deep Active Learning for Computer Vision: Past and Future
Figure 3 for Deep Active Learning for Computer Vision: Past and Future
Figure 4 for Deep Active Learning for Computer Vision: Past and Future
Viaarxiv icon

Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis

Add code
Nov 18, 2022
Viaarxiv icon