Alert button
Picture for Gal Dalal

Gal Dalal

Alert button

Tree Search-Based Policy Optimization under Stochastic Execution Delay

Add code
Bookmark button
Alert button
Apr 08, 2024
David Valensi, Esther Derman, Shie Mannor, Gal Dalal

Viaarxiv icon

Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization

Add code
Bookmark button
Alert button
Feb 15, 2024
Yihan Du, Anna Winnicki, Gal Dalal, Shie Mannor, R. Srikant

Viaarxiv icon

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search

Add code
Bookmark button
Alert button
Jan 30, 2023
Gal Dalal, Assaf Hallak, Gugan Thoppe, Shie Mannor, Gal Chechik

Figure 1 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 2 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 3 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 4 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Viaarxiv icon

SoftTreeMax: Policy Gradient with Tree Search

Add code
Bookmark button
Alert button
Sep 28, 2022
Gal Dalal, Assaf Hallak, Shie Mannor, Gal Chechik

Figure 1 for SoftTreeMax: Policy Gradient with Tree Search
Figure 2 for SoftTreeMax: Policy Gradient with Tree Search
Figure 3 for SoftTreeMax: Policy Gradient with Tree Search
Viaarxiv icon

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Add code
Bookmark button
Alert button
Jul 05, 2022
Benjamin Fuhrer, Yuval Shpigelman, Chen Tessler, Shie Mannor, Gal Chechik, Eitan Zahavi, Gal Dalal

Figure 1 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 2 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 3 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 4 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Viaarxiv icon

Reinforcement Learning with a Terminator

Add code
Bookmark button
Alert button
May 30, 2022
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal

Figure 1 for Reinforcement Learning with a Terminator
Figure 2 for Reinforcement Learning with a Terminator
Figure 3 for Reinforcement Learning with a Terminator
Figure 4 for Reinforcement Learning with a Terminator
Viaarxiv icon

Planning and Learning with Adaptive Lookahead

Add code
Bookmark button
Alert button
Jan 28, 2022
Aviv Rosenberg, Assaf Hallak, Shie Mannor, Gal Chechik, Gal Dalal

Figure 1 for Planning and Learning with Adaptive Lookahead
Figure 2 for Planning and Learning with Adaptive Lookahead
Figure 3 for Planning and Learning with Adaptive Lookahead
Figure 4 for Planning and Learning with Adaptive Lookahead
Viaarxiv icon

On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 13, 2021
Guy Tennenholtz, Assaf Hallak, Gal Dalal, Shie Mannor, Gal Chechik, Uri Shalit

Figure 1 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 2 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 3 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 4 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Viaarxiv icon

Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction

Add code
Bookmark button
Alert button
Jul 04, 2021
Assaf Hallak, Gal Dalal, Steven Dalton, Iuri Frosio, Shie Mannor, Gal Chechik

Figure 1 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 2 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 3 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 4 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Viaarxiv icon

Reinforcement Learning for Datacenter Congestion Control

Add code
Bookmark button
Alert button
Feb 18, 2021
Chen Tessler, Yuval Shpigelman, Gal Dalal, Amit Mandelbaum, Doron Haritan Kazakov, Benjamin Fuhrer, Gal Chechik, Shie Mannor

Figure 1 for Reinforcement Learning for Datacenter Congestion Control
Figure 2 for Reinforcement Learning for Datacenter Congestion Control
Figure 3 for Reinforcement Learning for Datacenter Congestion Control
Figure 4 for Reinforcement Learning for Datacenter Congestion Control
Viaarxiv icon