Alert button
Picture for Longbiao Wang

Longbiao Wang

Alert button

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Add code
Bookmark button
Alert button
Jul 05, 2022
Tongtong Song, Qiang Xu, Meng Ge, Longbiao Wang, Hao Shi, Yongjie Lv, Yuqin Lin, Jianwu Dang

Figure 1 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 2 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 3 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 4 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Viaarxiv icon

Iterative Sound Source Localization for Unknown Number of Sources

Add code
Bookmark button
Alert button
Jun 24, 2022
Yanjie Fu, Meng Ge, Haoran Yin, Xinyuan Qian, Longbiao Wang, Gaoyan Zhang, Jianwu Dang

Figure 1 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 2 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 3 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 4 for Iterative Sound Source Localization for Unknown Number of Sources
Viaarxiv icon

Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion

Add code
Bookmark button
Alert button
Apr 27, 2022
Sen Chen, Zhilei Liu, Jiaxing Liu, Longbiao Wang

Figure 1 for Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion
Figure 2 for Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion
Figure 3 for Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion
Figure 4 for Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion
Viaarxiv icon

L-SpEx: Localized Target Speaker Extraction

Add code
Bookmark button
Alert button
Feb 21, 2022
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li

Figure 1 for L-SpEx: Localized Target Speaker Extraction
Figure 2 for L-SpEx: Localized Target Speaker Extraction
Figure 3 for L-SpEx: Localized Target Speaker Extraction
Figure 4 for L-SpEx: Localized Target Speaker Extraction
Viaarxiv icon

Talking Head Generation with Audio and Speech Related Facial Action Units

Add code
Bookmark button
Alert button
Oct 19, 2021
Sen Chen, Zhilei Liu, Jiaxing Liu, Zhengxiang Yan, Longbiao Wang

Figure 1 for Talking Head Generation with Audio and Speech Related Facial Action Units
Figure 2 for Talking Head Generation with Audio and Speech Related Facial Action Units
Figure 3 for Talking Head Generation with Audio and Speech Related Facial Action Units
Figure 4 for Talking Head Generation with Audio and Speech Related Facial Action Units
Viaarxiv icon

Using multiple reference audios and style embedding constraints for speech synthesis

Add code
Bookmark button
Alert button
Oct 09, 2021
Cheng Gong, Longbiao Wang, Zhenhua Ling, Ju Zhang, Jianwu Dang

Figure 1 for Using multiple reference audios and style embedding constraints for speech synthesis
Figure 2 for Using multiple reference audios and style embedding constraints for speech synthesis
Figure 3 for Using multiple reference audios and style embedding constraints for speech synthesis
Figure 4 for Using multiple reference audios and style embedding constraints for speech synthesis
Viaarxiv icon

Information Sieve: Content Leakage Reduction in End-to-End Prosody For Expressive Speech Synthesis

Add code
Bookmark button
Alert button
Aug 04, 2021
Xudong Dai, Cheng Gong, Longbiao Wang, Kaili Zhang

Figure 1 for Information Sieve: Content Leakage Reduction in End-to-End Prosody For Expressive Speech Synthesis
Figure 2 for Information Sieve: Content Leakage Reduction in End-to-End Prosody For Expressive Speech Synthesis
Figure 3 for Information Sieve: Content Leakage Reduction in End-to-End Prosody For Expressive Speech Synthesis
Figure 4 for Information Sieve: Content Leakage Reduction in End-to-End Prosody For Expressive Speech Synthesis
Viaarxiv icon

Exploring Deep Learning for Joint Audio-Visual Lip Biometrics

Add code
Bookmark button
Alert button
Apr 17, 2021
Meng Liu, Longbiao Wang, Kong Aik Lee, Hanyi Zhang, Chang Zeng, Jianwu Dang

Figure 1 for Exploring Deep Learning for Joint Audio-Visual Lip Biometrics
Figure 2 for Exploring Deep Learning for Joint Audio-Visual Lip Biometrics
Figure 3 for Exploring Deep Learning for Joint Audio-Visual Lip Biometrics
Figure 4 for Exploring Deep Learning for Joint Audio-Visual Lip Biometrics
Viaarxiv icon

Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals

Add code
Bookmark button
Alert button
Nov 19, 2020
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li

Figure 1 for Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals
Figure 2 for Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals
Figure 3 for Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals
Figure 4 for Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals
Viaarxiv icon