Alert button
Picture for Shie Mannor

Shie Mannor

Alert button

Tree Search-Based Policy Optimization under Stochastic Execution Delay

Add code
Bookmark button
Alert button
Apr 08, 2024
David Valensi, Esther Derman, Shie Mannor, Gal Dalal

Viaarxiv icon

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

Add code
Bookmark button
Alert button
Mar 11, 2024
Navdeep Kumar, Yashaswini Murthy, Itai Shufaro, Kfir Y. Levy, R. Srikant, Shie Mannor

Figure 1 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 2 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 3 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 4 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Viaarxiv icon

Conservative DDPG -- Pessimistic RL without Ensemble

Add code
Bookmark button
Alert button
Mar 08, 2024
Nitsan Soffair, Shie Mannor

Figure 1 for Conservative DDPG -- Pessimistic RL without Ensemble
Figure 2 for Conservative DDPG -- Pessimistic RL without Ensemble
Figure 3 for Conservative DDPG -- Pessimistic RL without Ensemble
Figure 4 for Conservative DDPG -- Pessimistic RL without Ensemble
Viaarxiv icon

Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization

Add code
Bookmark button
Alert button
Feb 15, 2024
Yihan Du, Anna Winnicki, Gal Dalal, Shie Mannor, R. Srikant

Viaarxiv icon

Improving Token-Based World Models with Parallel Observation Prediction

Add code
Bookmark button
Alert button
Feb 13, 2024
Lior Cohen, Kaixin Wang, Bingyi Kang, Shie Mannor

Viaarxiv icon

SQT -- std $Q$-target

Add code
Bookmark button
Alert button
Feb 12, 2024
Nitsan Soffair, Dotan Di-Castro, Orly Avner, Shie Mannor

Viaarxiv icon

MinMaxMin $Q$-learning

Add code
Bookmark button
Alert button
Feb 12, 2024
Nitsan Soffair, Shie Mannor

Viaarxiv icon

Prospective Side Information for Latent MDPs

Add code
Bookmark button
Alert button
Oct 11, 2023
Jeongyeol Kwon, Yonathan Efroni, Shie Mannor, Constantine Caramanis

Viaarxiv icon

Optimization or Architecture: How to Hack Kalman Filtering

Add code
Bookmark button
Alert button
Oct 01, 2023
Ido Greenberg, Netanel Yannay, Shie Mannor

Viaarxiv icon