Alert button
Picture for Yash Chandak

Yash Chandak

Alert button

A/B testing under Interference with Partial Network Information

Add code
Bookmark button
Alert button
Apr 16, 2024
Shiv Shankar, Ritwik Sinha, Yash Chandak, Saayan Mitra, Madalina Fiterau

Viaarxiv icon

Adaptive Instrument Design for Indirect Experiments

Add code
Bookmark button
Alert button
Dec 05, 2023
Yash Chandak, Shiv Shankar, Vasilis Syrgkanis, Emma Brunskill

Viaarxiv icon

Behavior Alignment via Reward Function Optimization

Add code
Bookmark button
Alert button
Oct 31, 2023
Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno Castro da Silva

Viaarxiv icon

Supervised Pretraining Can Learn In-Context Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 26, 2023
Jonathan N. Lee, Annie Xie, Aldo Pacchiano, Yash Chandak, Chelsea Finn, Ofir Nachum, Emma Brunskill

Figure 1 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 2 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 3 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 4 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Viaarxiv icon

Coagent Networks: Generalized and Scaled

Add code
Bookmark button
Alert button
May 16, 2023
James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas

Figure 1 for Coagent Networks: Generalized and Scaled
Figure 2 for Coagent Networks: Generalized and Scaled
Figure 3 for Coagent Networks: Generalized and Scaled
Figure 4 for Coagent Networks: Generalized and Scaled
Viaarxiv icon

Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition

Add code
Bookmark button
Alert button
May 02, 2023
Yash Chandak, Shantanu Thakoor, Zhaohan Daniel Guo, Yunhao Tang, Remi Munos, Will Dabney, Diana L Borsa

Figure 1 for Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Figure 2 for Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Figure 3 for Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Figure 4 for Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Viaarxiv icon

Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments

Add code
Bookmark button
Alert button
Feb 23, 2023
Vincent Liu, Yash Chandak, Philip Thomas, Martha White

Figure 1 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Figure 2 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Figure 3 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Figure 4 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Viaarxiv icon

Optimization using Parallel Gradient Evaluations on Multiple Parameters

Add code
Bookmark button
Alert button
Feb 06, 2023
Yash Chandak, Shiv Shankar, Venkata Gandikota, Philip S. Thomas, Arya Mazumdar

Figure 1 for Optimization using Parallel Gradient Evaluations on Multiple Parameters
Figure 2 for Optimization using Parallel Gradient Evaluations on Multiple Parameters
Figure 3 for Optimization using Parallel Gradient Evaluations on Multiple Parameters
Viaarxiv icon

Off-Policy Evaluation for Action-Dependent Non-Stationary Environments

Add code
Bookmark button
Alert button
Jan 24, 2023
Yash Chandak, Shiv Shankar, Nathaniel D. Bastian, Bruno Castro da Silva, Emma Brunskil, Philip S. Thomas

Figure 1 for Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Figure 2 for Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Figure 3 for Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Figure 4 for Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Viaarxiv icon