Picture for Yi Yuan

Yi Yuan

NetEase Fuxi AI Lab

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Add code
Apr 30, 2024
Viaarxiv icon

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Apr 27, 2024
Figure 1 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 2 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 3 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 4 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Viaarxiv icon

HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback

Mar 14, 2024
Figure 1 for HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Figure 2 for HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Figure 3 for HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Figure 4 for HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Viaarxiv icon

Novel 3D Geometry-Based Stochastic Models for Non-Isotropic MIMO Vehicle-to-Vehicle Channels

Dec 01, 2023
Viaarxiv icon

High-Quality 3D Face Reconstruction with Affine Convolutional Networks

Add code
Oct 22, 2023
Figure 1 for High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Figure 2 for High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Figure 3 for High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Figure 4 for High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Viaarxiv icon

Retrieval-Augmented Text-to-Audio Generation

Sep 14, 2023
Figure 1 for Retrieval-Augmented Text-to-Audio Generation
Figure 2 for Retrieval-Augmented Text-to-Audio Generation
Figure 3 for Retrieval-Augmented Text-to-Audio Generation
Figure 4 for Retrieval-Augmented Text-to-Audio Generation
Viaarxiv icon

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Add code
Aug 10, 2023
Figure 1 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 2 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 3 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 4 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Viaarxiv icon

Separate Anything You Describe

Add code
Aug 09, 2023
Figure 1 for Separate Anything You Describe
Figure 2 for Separate Anything You Describe
Figure 3 for Separate Anything You Describe
Figure 4 for Separate Anything You Describe
Viaarxiv icon

WavJourney: Compositional Audio Creation with Large Language Models

Add code
Jul 26, 2023
Viaarxiv icon

Text-Driven Foley Sound Generation With Latent Diffusion Model

Add code
Jun 23, 2023
Figure 1 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 2 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 3 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 4 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Viaarxiv icon