Picture for Jianwu Dang

Jianwu Dang

Deep Spectro-temporal Artifacts for Detecting Synthesized Speech

Add code
Oct 11, 2022
Figure 1 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 2 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 3 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 4 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Viaarxiv icon

VCSE: Time-Domain Visual-Contextual Speaker Extraction Network

Add code
Oct 09, 2022
Figure 1 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 2 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 3 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 4 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Viaarxiv icon

MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources

Add code
Jul 15, 2022
Figure 1 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 2 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 3 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 4 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Viaarxiv icon

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Add code
Jul 05, 2022
Figure 1 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 2 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 3 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 4 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Viaarxiv icon

Iterative Sound Source Localization for Unknown Number of Sources

Add code
Jun 24, 2022
Figure 1 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 2 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 3 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 4 for Iterative Sound Source Localization for Unknown Number of Sources
Viaarxiv icon

Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning

Add code
Apr 30, 2022
Figure 1 for Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning
Figure 2 for Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning
Figure 3 for Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning
Figure 4 for Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning
Viaarxiv icon

TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding

Add code
Mar 17, 2022
Figure 1 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Figure 2 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Figure 3 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Figure 4 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Viaarxiv icon

L-SpEx: Localized Target Speaker Extraction

Add code
Feb 21, 2022
Figure 1 for L-SpEx: Localized Target Speaker Extraction
Figure 2 for L-SpEx: Localized Target Speaker Extraction
Figure 3 for L-SpEx: Localized Target Speaker Extraction
Figure 4 for L-SpEx: Localized Target Speaker Extraction
Viaarxiv icon

Using multiple reference audios and style embedding constraints for speech synthesis

Add code
Oct 09, 2021
Figure 1 for Using multiple reference audios and style embedding constraints for speech synthesis
Figure 2 for Using multiple reference audios and style embedding constraints for speech synthesis
Figure 3 for Using multiple reference audios and style embedding constraints for speech synthesis
Figure 4 for Using multiple reference audios and style embedding constraints for speech synthesis
Viaarxiv icon

Exploring Deep Learning for Joint Audio-Visual Lip Biometrics

Add code
Apr 17, 2021
Figure 1 for Exploring Deep Learning for Joint Audio-Visual Lip Biometrics
Figure 2 for Exploring Deep Learning for Joint Audio-Visual Lip Biometrics
Figure 3 for Exploring Deep Learning for Joint Audio-Visual Lip Biometrics
Figure 4 for Exploring Deep Learning for Joint Audio-Visual Lip Biometrics
Viaarxiv icon