Picture for Gal Dalal

Gal Dalal

Reliable Critics: Monotonic Improvement and Convergence Guarantees for Reinforcement Learning

Add code
Jun 08, 2025
Viaarxiv icon

Reinforcement Learning with Segment Feedback

Add code
Feb 03, 2025
Figure 1 for Reinforcement Learning with Segment Feedback
Figure 2 for Reinforcement Learning with Segment Feedback
Figure 3 for Reinforcement Learning with Segment Feedback
Figure 4 for Reinforcement Learning with Segment Feedback
Viaarxiv icon

Gradient Boosting Reinforcement Learning

Add code
Jul 11, 2024
Viaarxiv icon

PlaMo: Plan and Move in Rich 3D Physical Environments

Add code
Jun 26, 2024
Viaarxiv icon

Tree Search-Based Policy Optimization under Stochastic Execution Delay

Add code
Apr 08, 2024
Viaarxiv icon

Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization

Add code
Feb 15, 2024
Figure 1 for Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Figure 2 for Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Figure 3 for Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Viaarxiv icon

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search

Add code
Jan 30, 2023
Figure 1 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 2 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 3 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 4 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Viaarxiv icon

SoftTreeMax: Policy Gradient with Tree Search

Add code
Sep 28, 2022
Figure 1 for SoftTreeMax: Policy Gradient with Tree Search
Figure 2 for SoftTreeMax: Policy Gradient with Tree Search
Figure 3 for SoftTreeMax: Policy Gradient with Tree Search
Viaarxiv icon

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Add code
Jul 05, 2022
Figure 1 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 2 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 3 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 4 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Viaarxiv icon

Reinforcement Learning with a Terminator

Add code
May 30, 2022
Figure 1 for Reinforcement Learning with a Terminator
Figure 2 for Reinforcement Learning with a Terminator
Figure 3 for Reinforcement Learning with a Terminator
Figure 4 for Reinforcement Learning with a Terminator
Viaarxiv icon