Picture for Masatoshi Uehara

Masatoshi Uehara

Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review

Add code
Jul 18, 2024
Viaarxiv icon

Adding Conditional Control to Diffusion Models with Reinforcement Learning

Add code
Jun 17, 2024
Figure 1 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Figure 2 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Figure 3 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Figure 4 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Viaarxiv icon

Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

Add code
May 31, 2024
Viaarxiv icon

Regularized DeepIV with Model Selection

Add code
Mar 07, 2024
Figure 1 for Regularized DeepIV with Model Selection
Figure 2 for Regularized DeepIV with Model Selection
Figure 3 for Regularized DeepIV with Model Selection
Figure 4 for Regularized DeepIV with Model Selection
Viaarxiv icon

Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control

Add code
Feb 28, 2024
Figure 1 for Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Figure 2 for Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Figure 3 for Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Figure 4 for Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Viaarxiv icon

Feedback Efficient Online Fine-Tuning of Diffusion Models

Add code
Feb 27, 2024
Figure 1 for Feedback Efficient Online Fine-Tuning of Diffusion Models
Figure 2 for Feedback Efficient Online Fine-Tuning of Diffusion Models
Figure 3 for Feedback Efficient Online Fine-Tuning of Diffusion Models
Figure 4 for Feedback Efficient Online Fine-Tuning of Diffusion Models
Viaarxiv icon

Functional Graphical Models: Structure Enables Offline Data-Driven Optimization

Add code
Jan 12, 2024
Figure 1 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Figure 2 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Figure 3 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Figure 4 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Viaarxiv icon

Source Condition Double Robust Inference on Functionals of Inverse Problems

Add code
Jul 25, 2023
Figure 1 for Source Condition Double Robust Inference on Functionals of Inverse Problems
Figure 2 for Source Condition Double Robust Inference on Functionals of Inverse Problems
Viaarxiv icon

Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

Add code
Jun 26, 2023
Figure 1 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Figure 2 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Figure 3 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Figure 4 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Viaarxiv icon

How to Query Human Feedback Efficiently in RL?

Add code
May 29, 2023
Viaarxiv icon