Picture for Aldo Pacchiano

Aldo Pacchiano

On the Theory of Reinforcement Learning with Once-per-Episode Feedback

Add code
Jun 07, 2021
Figure 1 for On the Theory of Reinforcement Learning with Once-per-Episode Feedback
Viaarxiv icon

Parallelizing Contextual Linear Bandits

Add code
May 21, 2021
Figure 1 for Parallelizing Contextual Linear Bandits
Figure 2 for Parallelizing Contextual Linear Bandits
Figure 3 for Parallelizing Contextual Linear Bandits
Figure 4 for Parallelizing Contextual Linear Bandits
Viaarxiv icon

Near Optimal Policy Optimization via REPS

Add code
Mar 17, 2021
Viaarxiv icon

Unlocking Pixels for Reinforcement Learning via Implicit Attention

Add code
Mar 04, 2021
Figure 1 for Unlocking Pixels for Reinforcement Learning via Implicit Attention
Figure 2 for Unlocking Pixels for Reinforcement Learning via Implicit Attention
Figure 3 for Unlocking Pixels for Reinforcement Learning via Implicit Attention
Figure 4 for Unlocking Pixels for Reinforcement Learning via Implicit Attention
Viaarxiv icon

Deep Reinforcement Learning with Dynamic Optimism

Add code
Feb 09, 2021
Figure 1 for Deep Reinforcement Learning with Dynamic Optimism
Figure 2 for Deep Reinforcement Learning with Dynamic Optimism
Figure 3 for Deep Reinforcement Learning with Dynamic Optimism
Figure 4 for Deep Reinforcement Learning with Dynamic Optimism
Viaarxiv icon

ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning

Add code
Jan 19, 2021
Figure 1 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning
Figure 2 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning
Figure 3 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning
Figure 4 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning
Viaarxiv icon

Fairness with Continuous Optimal Transport

Add code
Jan 06, 2021
Figure 1 for Fairness with Continuous Optimal Transport
Figure 2 for Fairness with Continuous Optimal Transport
Figure 3 for Fairness with Continuous Optimal Transport
Figure 4 for Fairness with Continuous Optimal Transport
Viaarxiv icon

Regret Bound Balancing and Elimination for Model Selection in Bandits and RL

Add code
Dec 24, 2020
Figure 1 for Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
Figure 2 for Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
Viaarxiv icon

Online Model Selection for Reinforcement Learning with Function Approximation

Add code
Nov 19, 2020
Figure 1 for Online Model Selection for Reinforcement Learning with Function Approximation
Viaarxiv icon

Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian

Add code
Nov 12, 2020
Figure 1 for Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian
Figure 2 for Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian
Figure 3 for Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian
Figure 4 for Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian
Viaarxiv icon