Picture for Yi Yuan

Yi Yuan

NetEase Fuxi AI Lab

Universal Sound Separation with Self-Supervised Audio Masked Autoencoder

Add code
Jul 16, 2024
Viaarxiv icon

Improving Audio Generation with Visual Enhanced Caption

Add code
Jul 05, 2024
Viaarxiv icon

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Add code
Apr 30, 2024
Figure 1 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Figure 2 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Figure 3 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Figure 4 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Viaarxiv icon

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Add code
Apr 27, 2024
Figure 1 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 2 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 3 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 4 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Viaarxiv icon

HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback

Add code
Mar 14, 2024
Figure 1 for HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Figure 2 for HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Figure 3 for HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Figure 4 for HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Viaarxiv icon

Novel 3D Geometry-Based Stochastic Models for Non-Isotropic MIMO Vehicle-to-Vehicle Channels

Add code
Dec 01, 2023
Viaarxiv icon

High-Quality 3D Face Reconstruction with Affine Convolutional Networks

Add code
Oct 22, 2023
Figure 1 for High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Figure 2 for High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Figure 3 for High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Figure 4 for High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Viaarxiv icon

Retrieval-Augmented Text-to-Audio Generation

Add code
Sep 14, 2023
Figure 1 for Retrieval-Augmented Text-to-Audio Generation
Figure 2 for Retrieval-Augmented Text-to-Audio Generation
Figure 3 for Retrieval-Augmented Text-to-Audio Generation
Figure 4 for Retrieval-Augmented Text-to-Audio Generation
Viaarxiv icon

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Add code
Aug 10, 2023
Figure 1 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 2 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 3 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 4 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Viaarxiv icon

Separate Anything You Describe

Add code
Aug 09, 2023
Figure 1 for Separate Anything You Describe
Figure 2 for Separate Anything You Describe
Figure 3 for Separate Anything You Describe
Figure 4 for Separate Anything You Describe
Viaarxiv icon