Alert button
Picture for Matteo Papini

Matteo Papini

Alert button

Optimisic Information Directed Sampling

Add code
Bookmark button
Alert button
Feb 23, 2024
Gergely Neu, Matteo Papini, Ludovic Schwartz

Viaarxiv icon

No-Regret Reinforcement Learning in Smooth MDPs

Add code
Bookmark button
Alert button
Feb 06, 2024
Davide Maran, Alberto Maria Metelli, Matteo Papini, Marcello Restell

Viaarxiv icon

Importance-Weighted Offline Learning Done Right

Add code
Bookmark button
Alert button
Sep 27, 2023
Germano Gabbianelli, Gergely Neu, Matteo Papini

Viaarxiv icon

Offline Primal-Dual Reinforcement Learning for Linear MDPs

Add code
Bookmark button
Alert button
May 22, 2023
Germano Gabbianelli, Gergely Neu, Nneka Okolo, Matteo Papini

Figure 1 for Offline Primal-Dual Reinforcement Learning for Linear MDPs
Viaarxiv icon

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

Add code
Bookmark button
Alert button
Oct 24, 2022
Andrea Tirinzoni, Matteo Papini, Ahmed Touati, Alessandro Lazaric, Matteo Pirotta

Figure 1 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 2 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 3 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 4 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Viaarxiv icon

Online Learning with Off-Policy Feedback

Add code
Bookmark button
Alert button
Jul 18, 2022
Germano Gabbianelli, Matteo Papini, Gergely Neu

Figure 1 for Online Learning with Off-Policy Feedback
Viaarxiv icon

Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits

Add code
Bookmark button
Alert button
May 27, 2022
Gergely Neu, Julia Olkhovskaya, Matteo Papini, Ludovic Schwartz

Viaarxiv icon

Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

Add code
Bookmark button
Alert button
Oct 27, 2021
Matteo Papini, Andrea Tirinzoni, Aldo Pacchiano, Marcello Restelli, Alessandro Lazaric, Matteo Pirotta

Figure 1 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 2 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 3 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 4 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Viaarxiv icon

Leveraging Good Representations in Linear Contextual Bandits

Add code
Bookmark button
Alert button
Apr 08, 2021
Matteo Papini, Andrea Tirinzoni, Marcello Restelli, Alessandro Lazaric, Matteo Pirotta

Figure 1 for Leveraging Good Representations in Linear Contextual Bandits
Figure 2 for Leveraging Good Representations in Linear Contextual Bandits
Figure 3 for Leveraging Good Representations in Linear Contextual Bandits
Figure 4 for Leveraging Good Representations in Linear Contextual Bandits
Viaarxiv icon

Policy Optimization as Online Learning with Mediator Feedback

Add code
Bookmark button
Alert button
Dec 15, 2020
Alberto Maria Metelli, Matteo Papini, Pierluca D'Oro, Marcello Restelli

Figure 1 for Policy Optimization as Online Learning with Mediator Feedback
Figure 2 for Policy Optimization as Online Learning with Mediator Feedback
Figure 3 for Policy Optimization as Online Learning with Mediator Feedback
Figure 4 for Policy Optimization as Online Learning with Mediator Feedback
Viaarxiv icon