Picture for Xuyang Wang

Xuyang Wang

Online Speaker Diarization with Graph-based Label Generation

Add code
Nov 27, 2021
Figure 1 for Online Speaker Diarization with Graph-based Label Generation
Figure 2 for Online Speaker Diarization with Graph-based Label Generation
Figure 3 for Online Speaker Diarization with Graph-based Label Generation
Figure 4 for Online Speaker Diarization with Graph-based Label Generation
Viaarxiv icon

Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification

Add code
Oct 09, 2021
Figure 1 for Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification
Figure 2 for Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification
Figure 3 for Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification
Figure 4 for Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification
Viaarxiv icon

Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits

Add code
Jun 28, 2021
Figure 1 for Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits
Figure 2 for Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits
Figure 3 for Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits
Figure 4 for Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits
Viaarxiv icon

TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling

Add code
Apr 04, 2021
Figure 1 for TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling
Figure 2 for TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling
Figure 3 for TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling
Figure 4 for TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling
Viaarxiv icon

The 2020 Personalized Voice Trigger Challenge: Open Database, Evaluation Metrics and the Baseline Systems

Add code
Jan 06, 2021
Figure 1 for The 2020 Personalized Voice Trigger Challenge: Open Database, Evaluation Metrics and the Baseline Systems
Figure 2 for The 2020 Personalized Voice Trigger Challenge: Open Database, Evaluation Metrics and the Baseline Systems
Figure 3 for The 2020 Personalized Voice Trigger Challenge: Open Database, Evaluation Metrics and the Baseline Systems
Figure 4 for The 2020 Personalized Voice Trigger Challenge: Open Database, Evaluation Metrics and the Baseline Systems
Viaarxiv icon

Training Wake Word Detection with Synthesized Speech Data on Confusion Words

Add code
Nov 03, 2020
Figure 1 for Training Wake Word Detection with Synthesized Speech Data on Confusion Words
Figure 2 for Training Wake Word Detection with Synthesized Speech Data on Confusion Words
Figure 3 for Training Wake Word Detection with Synthesized Speech Data on Confusion Words
Figure 4 for Training Wake Word Detection with Synthesized Speech Data on Confusion Words
Viaarxiv icon

Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling

Add code
Aug 14, 2020
Figure 1 for Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling
Figure 2 for Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling
Figure 3 for Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling
Figure 4 for Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling
Viaarxiv icon

Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection

Add code
May 24, 2020
Figure 1 for Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection
Figure 2 for Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection
Figure 3 for Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection
Viaarxiv icon

A CNN-Based Blind Denoising Method for Endoscopic Images

Add code
Mar 16, 2020
Figure 1 for A CNN-Based Blind Denoising Method for Endoscopic Images
Figure 2 for A CNN-Based Blind Denoising Method for Endoscopic Images
Figure 3 for A CNN-Based Blind Denoising Method for Endoscopic Images
Figure 4 for A CNN-Based Blind Denoising Method for Endoscopic Images
Viaarxiv icon