Alert button
Picture for Longbiao Wang

Longbiao Wang

Alert button

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Add code
Bookmark button
Alert button
Jan 07, 2024
He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li

Viaarxiv icon

A Refining Underlying Information Framework for Monaural Speech Enhancement

Add code
Bookmark button
Alert button
Dec 24, 2023
Rui Cao, Tianrui Wang, Meng Ge, Longbiao Wang, Jianwu Dang

Viaarxiv icon

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

Add code
Bookmark button
Alert button
Dec 22, 2023
Cheng Gong, Xin Wang, Erica Cooper, Dan Wells, Longbiao Wang, Jianwu Dang, Korin Richmond, Junichi Yamagishi

Viaarxiv icon

Multi-Level Knowledge Distillation for Speech Emotion Recognition in Noisy Conditions

Add code
Bookmark button
Alert button
Dec 21, 2023
Yang Liu, Haoqin Sun, Geng Chen, Qingyue Wang, Zhen Zhao, Xugang Lu, Longbiao Wang

Viaarxiv icon

High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Chunyu Qiang, Hao Li, Yixin Tian, Yi Zhao, Ying Zhang, Longbiao Wang, Jianwu Dang

Figure 1 for High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
Figure 2 for High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
Figure 3 for High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
Viaarxiv icon

Learning Speech Representation From Contrastive Token-Acoustic Pretraining

Add code
Bookmark button
Alert button
Sep 06, 2023
Chunyu Qiang, Hao Li, Yixin Tian, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang

Figure 1 for Learning Speech Representation From Contrastive Token-Acoustic Pretraining
Figure 2 for Learning Speech Representation From Contrastive Token-Acoustic Pretraining
Viaarxiv icon

CPSP: Learning Speech Concepts From Phoneme Supervision

Add code
Bookmark button
Alert button
Sep 01, 2023
Chunyu Qiang, Hao Li, Yixin Tian, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang

Figure 1 for CPSP: Learning Speech Concepts From Phoneme Supervision
Figure 2 for CPSP: Learning Speech Concepts From Phoneme Supervision
Viaarxiv icon

Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding

Add code
Bookmark button
Alert button
Jul 28, 2023
Chunyu Qiang, Hao Li, Hao Ni, He Qu, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang

Figure 1 for Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Figure 2 for Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Figure 3 for Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Figure 4 for Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Viaarxiv icon

Rethinking the visual cues in audio-visual speaker extraction

Add code
Bookmark button
Alert button
Jun 05, 2023
Junjie Li, Meng Ge, Zexu pan, Rui Cao, Longbiao Wang, Jianwu Dang, Shiliang Zhang

Figure 1 for Rethinking the visual cues in audio-visual speaker extraction
Figure 2 for Rethinking the visual cues in audio-visual speaker extraction
Figure 3 for Rethinking the visual cues in audio-visual speaker extraction
Viaarxiv icon

speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition

Add code
Bookmark button
Alert button
May 30, 2023
Haoyu Lu, Nan Li, Tongtong Song, Longbiao Wang, Jianwu Dang, Xiaobao Wang, Shiliang Zhang

Figure 1 for speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition
Figure 2 for speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition
Figure 3 for speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition
Figure 4 for speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition
Viaarxiv icon