Alert button
Picture for Teodor V. Marinov

Teodor V. Marinov

Alert button

Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization

Add code
Bookmark button
Alert button
Mar 28, 2024
Teodor V. Marinov, Alekh Agarwal, Mircea Trofin

Figure 1 for Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization
Figure 2 for Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization
Figure 3 for Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization
Figure 4 for Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization
Viaarxiv icon

A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks

Add code
Bookmark button
Alert button
May 26, 2023
Jacob Abernethy, Alekh Agarwal, Teodor V. Marinov, Manfred K. Warmuth

Figure 1 for A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks
Figure 2 for A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks
Figure 3 for A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks
Figure 4 for A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks
Viaarxiv icon

Leveraging User-Triggered Supervision in Contextual Bandits

Add code
Bookmark button
Alert button
Feb 07, 2023
Alekh Agarwal, Claudio Gentile, Teodor V. Marinov

Viaarxiv icon

Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality

Add code
Bookmark button
Alert button
Jun 20, 2022
Teodor V. Marinov, Mehryar Mohri, Julian Zimmert

Figure 1 for Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality
Figure 2 for Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality
Figure 3 for Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality
Viaarxiv icon

The Pareto Frontier of model selection for general Contextual Bandits

Add code
Bookmark button
Alert button
Oct 25, 2021
Teodor V. Marinov, Julian Zimmert

Figure 1 for The Pareto Frontier of model selection for general Contextual Bandits
Viaarxiv icon

Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 02, 2021
Christoph Dann, Teodor V. Marinov, Mehryar Mohri, Julian Zimmert

Figure 1 for Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning
Figure 2 for Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning
Figure 3 for Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning
Figure 4 for Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning
Viaarxiv icon

Corralling Stochastic Bandit Algorithms

Add code
Bookmark button
Alert button
Jun 28, 2020
Raman Arora, Teodor V. Marinov, Mehryar Mohri

Figure 1 for Corralling Stochastic Bandit Algorithms
Figure 2 for Corralling Stochastic Bandit Algorithms
Figure 3 for Corralling Stochastic Bandit Algorithms
Figure 4 for Corralling Stochastic Bandit Algorithms
Viaarxiv icon

Private Stochastic Convex Optimization: Efficient Algorithms for Non-smooth Objectives

Add code
Bookmark button
Alert button
Feb 22, 2020
Raman Arora, Teodor V. Marinov, Enayat Ullah

Figure 1 for Private Stochastic Convex Optimization: Efficient Algorithms for Non-smooth Objectives
Viaarxiv icon

Bandits with Feedback Graphs and Switching Costs

Add code
Bookmark button
Alert button
Jul 29, 2019
Raman Arora, Teodor V. Marinov, Mehryar Mohri

Figure 1 for Bandits with Feedback Graphs and Switching Costs
Figure 2 for Bandits with Feedback Graphs and Switching Costs
Figure 3 for Bandits with Feedback Graphs and Switching Costs
Figure 4 for Bandits with Feedback Graphs and Switching Costs
Viaarxiv icon