Picture for Yishay Mansour

Yishay Mansour

School of Computer Science, Tel Aviv University, Google Research, Tel Aviv

Dueling Convex Optimization with General Preferences

Add code
Sep 27, 2022
Viaarxiv icon

Regret Minimization and Convergence to Equilibria in General-sum Markov Games

Add code
Aug 08, 2022
Figure 1 for Regret Minimization and Convergence to Equilibria in General-sum Markov Games
Figure 2 for Regret Minimization and Convergence to Equilibria in General-sum Markov Games
Viaarxiv icon

Optimism in Face of a Context: Regret Guarantees for Stochastic Contextual MDP

Add code
Jul 22, 2022
Figure 1 for Optimism in Face of a Context: Regret Guarantees for Stochastic Contextual MDP
Figure 2 for Optimism in Face of a Context: Regret Guarantees for Stochastic Contextual MDP
Viaarxiv icon

Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation

Add code
Jun 19, 2022
Figure 1 for Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Figure 2 for Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Figure 3 for Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Viaarxiv icon

There is no Accuracy-Interpretability Tradeoff in Reinforcement Learning for Mazes

Add code
Jun 09, 2022
Figure 1 for There is no Accuracy-Interpretability Tradeoff in Reinforcement Learning for Mazes
Figure 2 for There is no Accuracy-Interpretability Tradeoff in Reinforcement Learning for Mazes
Figure 3 for There is no Accuracy-Interpretability Tradeoff in Reinforcement Learning for Mazes
Figure 4 for There is no Accuracy-Interpretability Tradeoff in Reinforcement Learning for Mazes
Viaarxiv icon

What killed the Convex Booster ?

Add code
May 25, 2022
Figure 1 for What killed the Convex Booster ?
Figure 2 for What killed the Convex Booster ?
Figure 3 for What killed the Convex Booster ?
Figure 4 for What killed the Convex Booster ?
Viaarxiv icon

Strategizing against Learners in Bayesian Games

Add code
May 17, 2022
Viaarxiv icon

Modeling Attrition in Recommender Systems with Departing Bandits

Add code
Mar 25, 2022
Figure 1 for Modeling Attrition in Recommender Systems with Departing Bandits
Figure 2 for Modeling Attrition in Recommender Systems with Departing Bandits
Viaarxiv icon

Learning Efficiently Function Approximation for Contextual MDP

Add code
Mar 02, 2022
Figure 1 for Learning Efficiently Function Approximation for Contextual MDP
Viaarxiv icon

Benign Underfitting of Stochastic Gradient Descent

Add code
Mar 01, 2022
Figure 1 for Benign Underfitting of Stochastic Gradient Descent
Viaarxiv icon