Picture for Longbiao Wang

Longbiao Wang

MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation

Add code
Dec 07, 2022
Viaarxiv icon

The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results

Add code
Nov 03, 2022
Viaarxiv icon

I4U System Description for NIST SRE'20 CTS Challenge

Add code
Nov 02, 2022
Figure 1 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 2 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 3 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 4 for I4U System Description for NIST SRE'20 CTS Challenge
Viaarxiv icon

Monolingual Recognizers Fusion for Code-switching Speech Recognition

Add code
Nov 02, 2022
Viaarxiv icon

Deep Spectro-temporal Artifacts for Detecting Synthesized Speech

Add code
Oct 11, 2022
Figure 1 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 2 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 3 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 4 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Viaarxiv icon

VCSE: Time-Domain Visual-Contextual Speaker Extraction Network

Add code
Oct 09, 2022
Figure 1 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 2 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 3 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 4 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Viaarxiv icon

MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources

Add code
Jul 15, 2022
Figure 1 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 2 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 3 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 4 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Viaarxiv icon

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Add code
Jul 05, 2022
Figure 1 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 2 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 3 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 4 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Viaarxiv icon

Iterative Sound Source Localization for Unknown Number of Sources

Add code
Jun 24, 2022
Figure 1 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 2 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 3 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 4 for Iterative Sound Source Localization for Unknown Number of Sources
Viaarxiv icon

Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion

Add code
Apr 27, 2022
Figure 1 for Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion
Figure 2 for Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion
Figure 3 for Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion
Figure 4 for Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion
Viaarxiv icon