Alert button
Picture for Masatoshi Uehara

Masatoshi Uehara

Alert button

Regularized DeepIV with Model Selection

Add code
Bookmark button
Alert button
Mar 07, 2024
Zihao Li, Hui Lan, Vasilis Syrgkanis, Mengdi Wang, Masatoshi Uehara

Figure 1 for Regularized DeepIV with Model Selection
Figure 2 for Regularized DeepIV with Model Selection
Figure 3 for Regularized DeepIV with Model Selection
Figure 4 for Regularized DeepIV with Model Selection
Viaarxiv icon

Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control

Add code
Bookmark button
Alert button
Feb 28, 2024
Masatoshi Uehara, Yulai Zhao, Kevin Black, Ehsan Hajiramezanali, Gabriele Scalia, Nathaniel Lee Diamant, Alex M Tseng, Tommaso Biancalani, Sergey Levine

Viaarxiv icon

Feedback Efficient Online Fine-Tuning of Diffusion Models

Add code
Bookmark button
Alert button
Feb 27, 2024
Masatoshi Uehara, Yulai Zhao, Kevin Black, Ehsan Hajiramezanali, Gabriele Scalia, Nathaniel Lee Diamant, Alex M Tseng, Sergey Levine, Tommaso Biancalani

Viaarxiv icon

Functional Graphical Models: Structure Enables Offline Data-Driven Optimization

Add code
Bookmark button
Alert button
Jan 12, 2024
Jakub Grudzien Kuba, Masatoshi Uehara, Pieter Abbeel, Sergey Levine

Viaarxiv icon

Source Condition Double Robust Inference on Functionals of Inverse Problems

Add code
Bookmark button
Alert button
Jul 25, 2023
Andrew Bennett, Nathan Kallus, Xiaojie Mao, Whitney Newey, Vasilis Syrgkanis, Masatoshi Uehara

Figure 1 for Source Condition Double Robust Inference on Functionals of Inverse Problems
Figure 2 for Source Condition Double Robust Inference on Functionals of Inverse Problems
Viaarxiv icon

Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

Add code
Bookmark button
Alert button
Jun 26, 2023
Haruka Kiyohara, Masatoshi Uehara, Yusuke Narita, Nobuyuki Shimizu, Yasuo Yamamoto, Yuta Saito

Figure 1 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Figure 2 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Figure 3 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Figure 4 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Viaarxiv icon

How to Query Human Feedback Efficiently in RL?

Add code
Bookmark button
Alert button
May 29, 2023
Wenhao Zhan, Masatoshi Uehara, Wen Sun, Jason D. Lee

Viaarxiv icon

Provable Offline Reinforcement Learning with Human Feedback

Add code
Bookmark button
Alert button
May 24, 2023
Wenhao Zhan, Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun

Viaarxiv icon