Alert button
Picture for Jianwu Dang

Jianwu Dang

Alert button

A Refining Underlying Information Framework for Monaural Speech Enhancement

Add code
Bookmark button
Alert button
Dec 24, 2023
Rui Cao, Tianrui Wang, Meng Ge, Longbiao Wang, Jianwu Dang

Viaarxiv icon

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

Add code
Bookmark button
Alert button
Dec 22, 2023
Cheng Gong, Xin Wang, Erica Cooper, Dan Wells, Longbiao Wang, Jianwu Dang, Korin Richmond, Junichi Yamagishi

Viaarxiv icon

Ahpatron: A New Budgeted Online Kernel Learning Machine with Tighter Mistake Bound

Add code
Bookmark button
Alert button
Dec 12, 2023
Yun Liao, Junfan Li, Shizhong Liao, Qinghua Hu, Jianwu Dang

Viaarxiv icon

High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Chunyu Qiang, Hao Li, Yixin Tian, Yi Zhao, Ying Zhang, Longbiao Wang, Jianwu Dang

Figure 1 for High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
Figure 2 for High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
Figure 3 for High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
Viaarxiv icon

Learning Speech Representation From Contrastive Token-Acoustic Pretraining

Add code
Bookmark button
Alert button
Sep 06, 2023
Chunyu Qiang, Hao Li, Yixin Tian, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang

Figure 1 for Learning Speech Representation From Contrastive Token-Acoustic Pretraining
Figure 2 for Learning Speech Representation From Contrastive Token-Acoustic Pretraining
Viaarxiv icon

Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding

Add code
Bookmark button
Alert button
Jul 28, 2023
Chunyu Qiang, Hao Li, Hao Ni, He Qu, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang

Figure 1 for Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Figure 2 for Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Figure 3 for Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Figure 4 for Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Viaarxiv icon

Rethinking the visual cues in audio-visual speaker extraction

Add code
Bookmark button
Alert button
Jun 05, 2023
Junjie Li, Meng Ge, Zexu pan, Rui Cao, Longbiao Wang, Jianwu Dang, Shiliang Zhang

Figure 1 for Rethinking the visual cues in audio-visual speaker extraction
Figure 2 for Rethinking the visual cues in audio-visual speaker extraction
Figure 3 for Rethinking the visual cues in audio-visual speaker extraction
Viaarxiv icon

speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition

Add code
Bookmark button
Alert button
May 30, 2023
Haoyu Lu, Nan Li, Tongtong Song, Longbiao Wang, Jianwu Dang, Xiaobao Wang, Shiliang Zhang

Figure 1 for speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition
Figure 2 for speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition
Figure 3 for speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition
Figure 4 for speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition
Viaarxiv icon

Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation

Add code
Bookmark button
Alert button
May 18, 2023
Yanjie Fu, Meng Ge, Honglong Wang, Nan Li, Haoran Yin, Longbiao Wang, Gaoyan Zhang, Jianwu Dang, Chengyun Deng, Fei Wang

Figure 1 for Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation
Figure 2 for Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation
Figure 3 for Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation
Figure 4 for Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation
Viaarxiv icon

Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder

Add code
Bookmark button
Alert button
Mar 26, 2023
Hao Shi, Masato Mimura, Longbiao Wang, Jianwu Dang, Tatsuya Kawahara

Figure 1 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 2 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 3 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 4 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Viaarxiv icon