Picture for Cheng Yu

Cheng Yu

Using fine-tuning and min lookahead beam search to improve Whisper

Add code
Sep 19, 2023
Figure 1 for Using fine-tuning and min lookahead beam search to improve Whisper
Figure 2 for Using fine-tuning and min lookahead beam search to improve Whisper
Figure 3 for Using fine-tuning and min lookahead beam search to improve Whisper
Figure 4 for Using fine-tuning and min lookahead beam search to improve Whisper
Viaarxiv icon

Cross-Utterance Conditioned VAE for Speech Generation

Add code
Sep 08, 2023
Figure 1 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 2 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 3 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 4 for Cross-Utterance Conditioned VAE for Speech Generation
Viaarxiv icon

FaceChain: A Playground for Identity-Preserving Portrait Generation

Add code
Aug 28, 2023
Figure 1 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Figure 2 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Figure 3 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Figure 4 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Viaarxiv icon

Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech

Add code
May 09, 2022
Figure 1 for Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech
Figure 2 for Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech
Figure 3 for Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech
Figure 4 for Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech
Viaarxiv icon

Perceptual Contrast Stretching on Target Feature for Speech Enhancement

Add code
Apr 01, 2022
Figure 1 for Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Figure 2 for Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Figure 3 for Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Figure 4 for Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Viaarxiv icon

Conditional Diffusion Probabilistic Model for Speech Enhancement

Add code
Feb 10, 2022
Figure 1 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Figure 2 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Figure 3 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Figure 4 for Conditional Diffusion Probabilistic Model for Speech Enhancement
Viaarxiv icon

OSSEM: one-shot speaker adaptive speech enhancement using meta learning

Add code
Nov 10, 2021
Figure 1 for OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Figure 2 for OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Figure 3 for OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Figure 4 for OSSEM: one-shot speaker adaptive speech enhancement using meta learning
Viaarxiv icon

HASA-net: A non-intrusive hearing-aid speech assessment network

Add code
Nov 10, 2021
Figure 1 for HASA-net: A non-intrusive hearing-aid speech assessment network
Figure 2 for HASA-net: A non-intrusive hearing-aid speech assessment network
Figure 3 for HASA-net: A non-intrusive hearing-aid speech assessment network
Figure 4 for HASA-net: A non-intrusive hearing-aid speech assessment network
Viaarxiv icon

SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points

Add code
Nov 08, 2021
Figure 1 for SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Figure 2 for SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Figure 3 for SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Figure 4 for SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Viaarxiv icon

MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech

Add code
Oct 12, 2021
Figure 1 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Figure 2 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Figure 3 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Figure 4 for MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Viaarxiv icon