Alert button
Picture for Jerry Zhu

Jerry Zhu

Alert button

The Delusional Hedge Algorithm as a Model of Human Learning from Diverse Opinions

Add code
Bookmark button
Alert button
Feb 21, 2024
Yun-Shiuan Chuang, Jerry Zhu, Timothy T. Rogers

Viaarxiv icon

Policy Gradient Bayesian Robust Optimization for Imitation Learning

Add code
Bookmark button
Alert button
Jun 21, 2021
Zaynah Javed, Daniel S. Brown, Satvik Sharma, Jerry Zhu, Ashwin Balakrishna, Marek Petrik, Anca D. Dragan, Ken Goldberg

Figure 1 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Figure 2 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Figure 3 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Figure 4 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Viaarxiv icon

Corruption-Robust Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 11, 2021
Xuezhou Zhang, Yiding Chen, Jerry Zhu, Wen Sun

Figure 1 for Corruption-Robust Offline Reinforcement Learning
Viaarxiv icon