Alert button
Picture for Akshay Krishnamurthy

Akshay Krishnamurthy

Alert button

Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting

Add code
Bookmark button
Alert button
Feb 05, 2019
Akshay Krishnamurthy, John Langford, Aleksandrs Slivkins, Chicheng Zhang

Figure 1 for Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting
Viaarxiv icon

Provably efficient RL with Rich Observations via Latent State Decoding

Add code
Bookmark button
Alert button
Jan 25, 2019
Simon S. Du, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal, Miroslav Dudík, John Langford

Figure 1 for Provably efficient RL with Rich Observations via Latent State Decoding
Viaarxiv icon

Model-Based Reinforcement Learning in Contextual Decision Processes

Add code
Bookmark button
Alert button
Nov 21, 2018
Wen Sun, Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford

Figure 1 for Model-Based Reinforcement Learning in Contextual Decision Processes
Viaarxiv icon

Contextual bandits with surrogate losses: Margin bounds and efficient algorithms

Add code
Bookmark button
Alert button
Nov 04, 2018
Dylan J. Foster, Akshay Krishnamurthy

Figure 1 for Contextual bandits with surrogate losses: Margin bounds and efficient algorithms
Viaarxiv icon

On Oracle-Efficient PAC RL with Rich Observations

Add code
Bookmark button
Alert button
Oct 31, 2018
Christoph Dann, Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford, Robert E. Schapire

Figure 1 for On Oracle-Efficient PAC RL with Rich Observations
Viaarxiv icon

Semiparametric Contextual Bandits

Add code
Bookmark button
Alert button
Jul 16, 2018
Akshay Krishnamurthy, Zhiwei Steven Wu, Vasilis Syrgkanis

Figure 1 for Semiparametric Contextual Bandits
Viaarxiv icon

Myopic Bayesian Design of Experiments via Posterior Sampling and Probabilistic Programming

Add code
Bookmark button
Alert button
May 25, 2018
Kirthevasan Kandasamy, Willie Neiswanger, Reed Zhang, Akshay Krishnamurthy, Jeff Schneider, Barnabas Poczos

Figure 1 for Myopic Bayesian Design of Experiments via Posterior Sampling and Probabilistic Programming
Viaarxiv icon

Disagreement-based combinatorial pure exploration: Efficient algorithms and an analysis with localization

Add code
Bookmark button
Alert button
Nov 30, 2017
Tongyi Cao, Akshay Krishnamurthy

Figure 1 for Disagreement-based combinatorial pure exploration: Efficient algorithms and an analysis with localization
Viaarxiv icon

Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 15, 2017
Rajarshi Das, Shehzaad Dhuliawala, Manzil Zaheer, Luke Vilnis, Ishan Durugkar, Akshay Krishnamurthy, Alex Smola, Andrew McCallum

Figure 1 for Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning
Figure 2 for Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning
Figure 3 for Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning
Figure 4 for Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning
Viaarxiv icon

Active Learning for Cost-Sensitive Classification

Add code
Bookmark button
Alert button
Nov 13, 2017
Akshay Krishnamurthy, Alekh Agarwal, Tzu-Kuo Huang, Hal Daume III, John Langford

Figure 1 for Active Learning for Cost-Sensitive Classification
Figure 2 for Active Learning for Cost-Sensitive Classification
Figure 3 for Active Learning for Cost-Sensitive Classification
Figure 4 for Active Learning for Cost-Sensitive Classification
Viaarxiv icon