Picture for Aldo Pacchiano

Aldo Pacchiano

Best of Both Worlds Model Selection

Add code
Jun 29, 2022
Viaarxiv icon

Joint Representation Training in Sequential Tasks with Shared Structure

Add code
Jun 24, 2022
Viaarxiv icon

Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback

Add code
May 15, 2022
Figure 1 for Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback
Figure 2 for Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback
Figure 3 for Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback
Viaarxiv icon

Meta Learning MDPs with Linear Transition Models

Add code
Jan 21, 2022
Viaarxiv icon

Neural Pseudo-Label Optimism for the Bank Loan Problem

Add code
Dec 03, 2021
Figure 1 for Neural Pseudo-Label Optimism for the Bank Loan Problem
Figure 2 for Neural Pseudo-Label Optimism for the Bank Loan Problem
Viaarxiv icon

An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit

Add code
Nov 08, 2021
Figure 1 for An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit
Figure 2 for An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit
Figure 3 for An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit
Viaarxiv icon

Dueling RL: Reinforcement Learning with Trajectory Preferences

Add code
Nov 08, 2021
Viaarxiv icon

Towards an Understanding of Default Policies in Multitask Policy Optimization

Add code
Nov 06, 2021
Figure 1 for Towards an Understanding of Default Policies in Multitask Policy Optimization
Figure 2 for Towards an Understanding of Default Policies in Multitask Policy Optimization
Figure 3 for Towards an Understanding of Default Policies in Multitask Policy Optimization
Figure 4 for Towards an Understanding of Default Policies in Multitask Policy Optimization
Viaarxiv icon

Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

Add code
Oct 27, 2021
Figure 1 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 2 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 3 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 4 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Viaarxiv icon

Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity

Add code
Jun 15, 2021
Figure 1 for Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Figure 2 for Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Figure 3 for Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Figure 4 for Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Viaarxiv icon