Picture for Yulai Zhao

Yulai Zhao

Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review

Add code
Jul 18, 2024
Viaarxiv icon

Adding Conditional Control to Diffusion Models with Reinforcement Learning

Add code
Jun 17, 2024
Figure 1 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Figure 2 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Figure 3 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Figure 4 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Viaarxiv icon

Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

Add code
May 31, 2024
Viaarxiv icon

Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control

Add code
Feb 28, 2024
Figure 1 for Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Figure 2 for Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Figure 3 for Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Figure 4 for Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Viaarxiv icon

Feedback Efficient Online Fine-Tuning of Diffusion Models

Add code
Feb 27, 2024
Figure 1 for Feedback Efficient Online Fine-Tuning of Diffusion Models
Figure 2 for Feedback Efficient Online Fine-Tuning of Diffusion Models
Figure 3 for Feedback Efficient Online Fine-Tuning of Diffusion Models
Figure 4 for Feedback Efficient Online Fine-Tuning of Diffusion Models
Viaarxiv icon

Provably Efficient CVaR RL in Low-rank MDPs

Add code
Nov 20, 2023
Viaarxiv icon

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

Add code
May 08, 2023
Figure 1 for Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Viaarxiv icon

Blessing of Class Diversity in Pre-training

Add code
Sep 12, 2022
Figure 1 for Blessing of Class Diversity in Pre-training
Figure 2 for Blessing of Class Diversity in Pre-training
Figure 3 for Blessing of Class Diversity in Pre-training
Figure 4 for Blessing of Class Diversity in Pre-training
Viaarxiv icon

Optimizing the Performative Risk under Weak Convexity Assumptions

Add code
Sep 12, 2022
Viaarxiv icon

Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games

Add code
Feb 17, 2021
Viaarxiv icon