Alert button
Picture for Nadav Merlis

Nadav Merlis

Alert button

The Value of Reward Lookahead in Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 18, 2024
Nadav Merlis, Dorian Baudry, Vianney Perchet

Figure 1 for The Value of Reward Lookahead in Reinforcement Learning
Figure 2 for The Value of Reward Lookahead in Reinforcement Learning
Figure 3 for The Value of Reward Lookahead in Reinforcement Learning
Viaarxiv icon

Reinforcement Learning with History-Dependent Dynamic Contexts

Add code
Bookmark button
Alert button
Feb 04, 2023
Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier

Figure 1 for Reinforcement Learning with History-Dependent Dynamic Contexts
Figure 2 for Reinforcement Learning with History-Dependent Dynamic Contexts
Viaarxiv icon

Reinforcement Learning with a Terminator

Add code
Bookmark button
Alert button
May 30, 2022
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal

Figure 1 for Reinforcement Learning with a Terminator
Figure 2 for Reinforcement Learning with a Terminator
Figure 3 for Reinforcement Learning with a Terminator
Figure 4 for Reinforcement Learning with a Terminator
Viaarxiv icon

Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits

Add code
Bookmark button
Alert button
Oct 12, 2021
Nadav Merlis, Yonathan Efroni, Shie Mannor

Figure 1 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 2 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 3 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 4 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Viaarxiv icon

Ensemble Bootstrapping for Q-Learning

Add code
Bookmark button
Alert button
Feb 28, 2021
Oren Peer, Chen Tessler, Nadav Merlis, Ron Meir

Figure 1 for Ensemble Bootstrapping for Q-Learning
Figure 2 for Ensemble Bootstrapping for Q-Learning
Figure 3 for Ensemble Bootstrapping for Q-Learning
Figure 4 for Ensemble Bootstrapping for Q-Learning
Viaarxiv icon

Confidence-Budget Matching for Sequential Budgeted Learning

Add code
Bookmark button
Alert button
Feb 05, 2021
Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor

Viaarxiv icon

Lenient Regret for Multi-Armed Bandits

Add code
Bookmark button
Alert button
Sep 13, 2020
Nadav Merlis, Shie Mannor

Figure 1 for Lenient Regret for Multi-Armed Bandits
Figure 2 for Lenient Regret for Multi-Armed Bandits
Figure 3 for Lenient Regret for Multi-Armed Bandits
Viaarxiv icon

Reinforcement Learning with Trajectory Feedback

Add code
Bookmark button
Alert button
Aug 13, 2020
Yonathan Efroni, Nadav Merlis, Shie Mannor

Figure 1 for Reinforcement Learning with Trajectory Feedback
Viaarxiv icon

Tight Lower Bounds for Combinatorial Multi-Armed Bandits

Add code
Bookmark button
Alert button
Feb 13, 2020
Nadav Merlis, Shie Mannor

Figure 1 for Tight Lower Bounds for Combinatorial Multi-Armed Bandits
Figure 2 for Tight Lower Bounds for Combinatorial Multi-Armed Bandits
Figure 3 for Tight Lower Bounds for Combinatorial Multi-Armed Bandits
Viaarxiv icon