Picture for Ilija Bogunovic

Ilija Bogunovic

Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration

Add code
Dec 01, 2023
Viaarxiv icon

REDUCR: Robust Data Downsampling Using Class Priority Reweighting

Add code
Dec 01, 2023
Viaarxiv icon

Robust Best-arm Identification in Linear Bandits

Add code
Nov 08, 2023
Viaarxiv icon

Distributionally Robust Model-based Reinforcement Learning with Large State Spaces

Add code
Sep 05, 2023
Viaarxiv icon

Safe Model-Based Multi-Agent Mean-Field Reinforcement Learning

Add code
Jun 29, 2023
Viaarxiv icon

Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

Add code
Feb 08, 2023
Viaarxiv icon

Near-optimal Policy Identification in Active Reinforcement Learning

Add code
Dec 19, 2022
Viaarxiv icon

Movement Penalized Bayesian Optimization with Application to Wind Energy Systems

Add code
Oct 14, 2022
Figure 1 for Movement Penalized Bayesian Optimization with Application to Wind Energy Systems
Figure 2 for Movement Penalized Bayesian Optimization with Application to Wind Energy Systems
Figure 3 for Movement Penalized Bayesian Optimization with Application to Wind Energy Systems
Figure 4 for Movement Penalized Bayesian Optimization with Application to Wind Energy Systems
Viaarxiv icon

Graph Neural Network Bandits

Add code
Jul 13, 2022
Figure 1 for Graph Neural Network Bandits
Figure 2 for Graph Neural Network Bandits
Figure 3 for Graph Neural Network Bandits
Figure 4 for Graph Neural Network Bandits
Viaarxiv icon

A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits

Add code
Feb 03, 2022
Figure 1 for A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits
Figure 2 for A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits
Figure 3 for A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits
Figure 4 for A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits
Viaarxiv icon