Picture for Alessio Russo

Alessio Russo

Pure Exploration with Feedback Graphs

Add code
Mar 10, 2025
Figure 1 for Pure Exploration with Feedback Graphs
Figure 2 for Pure Exploration with Feedback Graphs
Figure 3 for Pure Exploration with Feedback Graphs
Figure 4 for Pure Exploration with Feedback Graphs
Viaarxiv icon

Adaptive Exploration for Multi-Reward Multi-Policy Evaluation

Add code
Feb 04, 2025
Figure 1 for Adaptive Exploration for Multi-Reward Multi-Policy Evaluation
Figure 2 for Adaptive Exploration for Multi-Reward Multi-Policy Evaluation
Figure 3 for Adaptive Exploration for Multi-Reward Multi-Policy Evaluation
Figure 4 for Adaptive Exploration for Multi-Reward Multi-Policy Evaluation
Viaarxiv icon

Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models

Add code
Jan 30, 2025
Figure 1 for Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models
Figure 2 for Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models
Figure 3 for Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models
Figure 4 for Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models
Viaarxiv icon

Explainable Reinforcement Learning via Temporal Policy Decomposition

Add code
Jan 07, 2025
Figure 1 for Explainable Reinforcement Learning via Temporal Policy Decomposition
Figure 2 for Explainable Reinforcement Learning via Temporal Policy Decomposition
Figure 3 for Explainable Reinforcement Learning via Temporal Policy Decomposition
Figure 4 for Explainable Reinforcement Learning via Temporal Policy Decomposition
Viaarxiv icon

Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation

Add code
Oct 30, 2024
Figure 1 for Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation
Figure 2 for Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation
Figure 3 for Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation
Figure 4 for Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation
Viaarxiv icon

Efficient Learning of POMDPs with Known Observation Model in Average-Reward Setting

Add code
Oct 02, 2024
Viaarxiv icon

Fair Best Arm Identification with Fixed Confidence

Add code
Aug 30, 2024
Figure 1 for Fair Best Arm Identification with Fixed Confidence
Figure 2 for Fair Best Arm Identification with Fixed Confidence
Figure 3 for Fair Best Arm Identification with Fixed Confidence
Figure 4 for Fair Best Arm Identification with Fixed Confidence
Viaarxiv icon

Model-Free Active Exploration in Reinforcement Learning

Add code
Jun 30, 2024
Viaarxiv icon

Conformal Off-Policy Evaluation in Markov Decision Processes

Add code
Apr 05, 2023
Viaarxiv icon

On the Sample Complexity of Representation Learning in Multi-task Bandits with Global and Local structure

Add code
Nov 28, 2022
Viaarxiv icon