Alert button
Picture for Akshay Krishnamurthy

Akshay Krishnamurthy

Alert button

Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient

Add code
Bookmark button
Alert button
Oct 13, 2022
Yuda Song, Yifei Zhou, Ayush Sekhari, J. Andrew Bagnell, Akshay Krishnamurthy, Wen Sun

Figure 1 for Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Figure 2 for Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Figure 3 for Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Figure 4 for Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Viaarxiv icon

Guaranteed Discovery of Controllable Latent States with Multi-Step Inverse Models

Add code
Bookmark button
Alert button
Jul 17, 2022
Alex Lamb, Riashat Islam, Yonathan Efroni, Aniket Didolkar, Dipendra Misra, Dylan Foster, Lekan Molu, Rajan Chari, Akshay Krishnamurthy, John Langford

Figure 1 for Guaranteed Discovery of Controllable Latent States with Multi-Step Inverse Models
Figure 2 for Guaranteed Discovery of Controllable Latent States with Multi-Step Inverse Models
Figure 3 for Guaranteed Discovery of Controllable Latent States with Multi-Step Inverse Models
Figure 4 for Guaranteed Discovery of Controllable Latent States with Multi-Step Inverse Models
Viaarxiv icon

On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

Add code
Bookmark button
Alert button
Jun 21, 2022
Jinglin Chen, Aditya Modi, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal

Figure 1 for On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
Figure 2 for On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
Figure 3 for On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
Figure 4 for On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
Viaarxiv icon

Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information

Add code
Bookmark button
Alert button
Jun 09, 2022
Yonathan Efroni, Dylan J. Foster, Dipendra Misra, Akshay Krishnamurthy, John Langford

Figure 1 for Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information
Viaarxiv icon

A Sharp Characterization of Linear Estimators for Offline Policy Evaluation

Add code
Bookmark button
Alert button
Mar 08, 2022
Juan C. Perdomo, Akshay Krishnamurthy, Peter Bartlett, Sham Kakade

Viaarxiv icon

Understanding Contrastive Learning Requires Incorporating Inductive Biases

Add code
Bookmark button
Alert button
Feb 28, 2022
Nikunj Saunshi, Jordan Ash, Surbhi Goel, Dipendra Misra, Cyril Zhang, Sanjeev Arora, Sham Kakade, Akshay Krishnamurthy

Figure 1 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Figure 2 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Figure 3 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Figure 4 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Viaarxiv icon

Provable Reinforcement Learning with a Short-Term Memory

Add code
Bookmark button
Alert button
Feb 08, 2022
Yonathan Efroni, Chi Jin, Akshay Krishnamurthy, Sobhan Miryoosefi

Figure 1 for Provable Reinforcement Learning with a Short-Term Memory
Figure 2 for Provable Reinforcement Learning with a Short-Term Memory
Viaarxiv icon

Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability

Add code
Bookmark button
Alert button
Nov 24, 2021
Aadirupa Saha, Akshay Krishnamurthy

Viaarxiv icon

Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation

Add code
Bookmark button
Alert button
Nov 21, 2021
Dylan J. Foster, Akshay Krishnamurthy, David Simchi-Levi, Yunzong Xu

Figure 1 for Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation
Figure 2 for Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation
Viaarxiv icon

Universal and data-adaptive algorithms for model selection in linear contextual bandits

Add code
Bookmark button
Alert button
Nov 08, 2021
Vidya Muthukumar, Akshay Krishnamurthy

Figure 1 for Universal and data-adaptive algorithms for model selection in linear contextual bandits
Figure 2 for Universal and data-adaptive algorithms for model selection in linear contextual bandits
Figure 3 for Universal and data-adaptive algorithms for model selection in linear contextual bandits
Viaarxiv icon