Alert button
Picture for Aldo Pacchiano

Aldo Pacchiano

Alert button

Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback

Add code
Bookmark button
Alert button
May 15, 2022
Tianyi Lin, Aldo Pacchiano, Yaodong Yu, Michael I. Jordan

Figure 1 for Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback
Figure 2 for Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback
Figure 3 for Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback
Viaarxiv icon

Meta Learning MDPs with Linear Transition Models

Add code
Bookmark button
Alert button
Jan 21, 2022
Robert Müller, Aldo Pacchiano

Viaarxiv icon

Neural Pseudo-Label Optimism for the Bank Loan Problem

Add code
Bookmark button
Alert button
Dec 03, 2021
Aldo Pacchiano, Shaun Singh, Edward Chou, Alexander C. Berg, Jakob Foerster

Figure 1 for Neural Pseudo-Label Optimism for the Bank Loan Problem
Figure 2 for Neural Pseudo-Label Optimism for the Bank Loan Problem
Viaarxiv icon

An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit

Add code
Bookmark button
Alert button
Nov 08, 2021
Aldo Pacchiano, Peter Bartlett, Michael I. Jordan

Figure 1 for An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit
Figure 2 for An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit
Figure 3 for An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit
Viaarxiv icon

Dueling RL: Reinforcement Learning with Trajectory Preferences

Add code
Bookmark button
Alert button
Nov 08, 2021
Aldo Pacchiano, Aadirupa Saha, Jonathan Lee

Viaarxiv icon

Towards an Understanding of Default Policies in Multitask Policy Optimization

Add code
Bookmark button
Alert button
Nov 06, 2021
Ted Moskovitz, Michael Arbel, Jack Parker-Holder, Aldo Pacchiano

Figure 1 for Towards an Understanding of Default Policies in Multitask Policy Optimization
Figure 2 for Towards an Understanding of Default Policies in Multitask Policy Optimization
Figure 3 for Towards an Understanding of Default Policies in Multitask Policy Optimization
Figure 4 for Towards an Understanding of Default Policies in Multitask Policy Optimization
Viaarxiv icon

Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

Add code
Bookmark button
Alert button
Oct 27, 2021
Matteo Papini, Andrea Tirinzoni, Aldo Pacchiano, Marcello Restelli, Alessandro Lazaric, Matteo Pirotta

Figure 1 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 2 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 3 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 4 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Viaarxiv icon

Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity

Add code
Bookmark button
Alert button
Jun 15, 2021
Dhruv Malik, Aldo Pacchiano, Vishwak Srinivasan, Yuanzhi Li

Figure 1 for Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Figure 2 for Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Figure 3 for Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Figure 4 for Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Viaarxiv icon

On the Theory of Reinforcement Learning with Once-per-Episode Feedback

Add code
Bookmark button
Alert button
Jun 07, 2021
Niladri S. Chatterji, Aldo Pacchiano, Peter L. Bartlett, Michael I. Jordan

Figure 1 for On the Theory of Reinforcement Learning with Once-per-Episode Feedback
Viaarxiv icon