Alert button
Picture for Edwin Zhang

Edwin Zhang

Alert button

A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health

Add code
Bookmark button
Alert button
Feb 23, 2024
Nikhil Behari, Edwin Zhang, Yunfan Zhao, Aparna Taneja, Dheeraj Nagaraj, Milind Tambe

Viaarxiv icon

Social Environment Design

Add code
Bookmark button
Alert button
Feb 21, 2024
Edwin Zhang, Sadie Zhao, Tonghan Wang, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, Yiling Chen

Viaarxiv icon

Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping

Add code
Bookmark button
Alert button
Dec 18, 2023
Lauren H. Cooke, Harvey Klyne, Edwin Zhang, Cassidy Laidlaw, Milind Tambe, Finale Doshi-Velez

Viaarxiv icon

Towards Zero Shot Learning in Restless Multi-armed Bandits

Add code
Bookmark button
Alert button
Oct 23, 2023
Yunfan Zhao, Nikhil Behari, Edward Hughes, Edwin Zhang, Dheeraj Nagaraj, Karl Tuyls, Aparna Taneja, Milind Tambe

Viaarxiv icon

Offline Reinforcement Learning with Closed-Form Policy Improvement Operators

Add code
Bookmark button
Alert button
Nov 29, 2022
Jiachen Li, Edwin Zhang, Ming Yin, Qinxun Bai, Yu-Xiang Wang, William Yang Wang

Figure 1 for Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Figure 2 for Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Figure 3 for Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Figure 4 for Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Viaarxiv icon

LAD: Language Augmented Diffusion for Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 27, 2022
Edwin Zhang, Yujie Lu, William Wang, Amy Zhang

Figure 1 for LAD: Language Augmented Diffusion for Reinforcement Learning
Figure 2 for LAD: Language Augmented Diffusion for Reinforcement Learning
Figure 3 for LAD: Language Augmented Diffusion for Reinforcement Learning
Figure 4 for LAD: Language Augmented Diffusion for Reinforcement Learning
Viaarxiv icon

Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset

Add code
Bookmark button
Alert button
Jul 14, 2020
Edwin Zhang, Nikhil Gupta, Raphael Tang, Xiao Han, Ronak Pradeep, Kuang Lu, Yue Zhang, Rodrigo Nogueira, Kyunghyun Cho, Hui Fang, Jimmy Lin

Figure 1 for Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset
Figure 2 for Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset
Viaarxiv icon

Rapidly Bootstrapping a Question Answering Dataset for COVID-19

Add code
Bookmark button
Alert button
Apr 23, 2020
Raphael Tang, Rodrigo Nogueira, Edwin Zhang, Nikhil Gupta, Phuong Cam, Kyunghyun Cho, Jimmy Lin

Figure 1 for Rapidly Bootstrapping a Question Answering Dataset for COVID-19
Viaarxiv icon

Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned

Add code
Bookmark button
Alert button
Apr 10, 2020
Edwin Zhang, Nikhil Gupta, Rodrigo Nogueira, Kyunghyun Cho, Jimmy Lin

Figure 1 for Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned
Figure 2 for Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned
Viaarxiv icon