Alert button
Picture for Cheng Yu

Cheng Yu

Alert button

Using fine-tuning and min lookahead beam search to improve Whisper

Sep 19, 2023
Andrea Do, Oscar Brown, Zhengjie Wang, Nikhil Mathew, Zixin Liu, Jawwad Ahmed, Cheng Yu

Figure 1 for Using fine-tuning and min lookahead beam search to improve Whisper
Figure 2 for Using fine-tuning and min lookahead beam search to improve Whisper
Figure 3 for Using fine-tuning and min lookahead beam search to improve Whisper
Figure 4 for Using fine-tuning and min lookahead beam search to improve Whisper
Viaarxiv icon

Cross-Utterance Conditioned VAE for Speech Generation

Sep 08, 2023
Yang Li, Cheng Yu, Guangzhi Sun, Weiqin Zu, Zheng Tian, Ying Wen, Wei Pan, Chao Zhang, Jun Wang, Yang Yang, Fanglei Sun

Figure 1 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 2 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 3 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 4 for Cross-Utterance Conditioned VAE for Speech Generation
Viaarxiv icon

FaceChain: A Playground for Identity-Preserving Portrait Generation

Aug 28, 2023
Yang Liu, Cheng Yu, Lei Shang, Ziheng Wu, Xingjun Wang, Yuze Zhao, Lin Zhu, Chen Cheng, Weitao Chen, Chao Xu, Haoyu Xie, Yuan Yao, Wenmeng Zhou, Yingda Chen, Xuansong Xie, Baigui Sun

Figure 1 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Figure 2 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Viaarxiv icon

Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech

May 09, 2022
Yang Li, Cheng Yu, Guangzhi Sun, Hua Jiang, Fanglei Sun, Weiqin Zu, Ying Wen, Yang Yang, Jun Wang

Figure 1 for Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech
Figure 2 for Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech
Figure 3 for Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech
Figure 4 for Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech
Viaarxiv icon

Perceptual Contrast Stretching on Target Feature for Speech Enhancement

Apr 01, 2022
Rong Chao, Cheng Yu, Szu-Wei Fu, Xugang Lu, Yu Tsao

Figure 1 for Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Figure 2 for Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Figure 3 for Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Figure 4 for Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Viaarxiv icon

Conditional Diffusion Probabilistic Model for Speech Enhancement

Feb 10, 2022
Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu Tsao

Figure 1 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Figure 2 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Figure 3 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Figure 4 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Viaarxiv icon

OSSEM: one-shot speaker adaptive speech enhancement using meta learning

Nov 10, 2021
Cheng Yu, Szu-Wei Fu, Tsun-An Hsieh, Yu Tsao, Mirco Ravanelli

Figure 1 for OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Figure 2 for OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Figure 3 for OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Figure 4 for OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Viaarxiv icon

HASA-net: A non-intrusive hearing-aid speech assessment network

Nov 10, 2021
Hsin-Tien Chiang, Yi-Chiao Wu, Cheng Yu, Tomoki Toda, Hsin-Min Wang, Yih-Chun Hu, Yu Tsao

Figure 1 for HASA-net: A non-intrusive hearing-aid speech assessment network
Figure 2 for HASA-net: A non-intrusive hearing-aid speech assessment network
Figure 3 for HASA-net: A non-intrusive hearing-aid speech assessment network
Figure 4 for HASA-net: A non-intrusive hearing-aid speech assessment network
Viaarxiv icon

SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points

Nov 08, 2021
Yu-Chen Lin, Cheng Yu, Yi-Te Hsu, Szu-Wei Fu, Yu Tsao, Tei-Wei Kuo

Figure 1 for SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Figure 2 for SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Figure 3 for SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Figure 4 for SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Viaarxiv icon

MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech

Oct 12, 2021
Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao

Figure 1 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Figure 2 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Figure 3 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Figure 4 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Viaarxiv icon