Alert button
Picture for John Langford

John Langford

Alert button

Efficient Contextual Bandits with Continuous Actions

Add code
Bookmark button
Alert button
Jun 10, 2020
Maryam Majzoubi, Chicheng Zhang, Rajan Chari, Akshay Krishnamurthy, John Langford, Aleksandrs Slivkins

Figure 1 for Efficient Contextual Bandits with Continuous Actions
Figure 2 for Efficient Contextual Bandits with Continuous Actions
Figure 3 for Efficient Contextual Bandits with Continuous Actions
Figure 4 for Efficient Contextual Bandits with Continuous Actions
Viaarxiv icon

Federated Residual Learning

Add code
Bookmark button
Alert button
Mar 28, 2020
Alekh Agarwal, John Langford, Chen-Yu Wei

Figure 1 for Federated Residual Learning
Figure 2 for Federated Residual Learning
Figure 3 for Federated Residual Learning
Figure 4 for Federated Residual Learning
Viaarxiv icon

Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 13, 2019
Dipendra Misra, Mikael Henaff, Akshay Krishnamurthy, John Langford

Figure 1 for Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Figure 2 for Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Figure 3 for Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Figure 4 for Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Viaarxiv icon

Empirical Likelihood for Contextual Bandits

Add code
Bookmark button
Alert button
Jun 21, 2019
Nikos Karampatziakis, John Langford, Paul Mineiro

Figure 1 for Empirical Likelihood for Contextual Bandits
Figure 2 for Empirical Likelihood for Contextual Bandits
Figure 3 for Empirical Likelihood for Contextual Bandits
Figure 4 for Empirical Likelihood for Contextual Bandits
Viaarxiv icon

Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds

Add code
Bookmark button
Alert button
Jun 09, 2019
Jordan T. Ash, Chicheng Zhang, Akshay Krishnamurthy, John Langford, Alekh Agarwal

Figure 1 for Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds
Figure 2 for Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds
Figure 3 for Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds
Figure 4 for Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds
Viaarxiv icon

Efficient Forward Architecture Search

Add code
Bookmark button
Alert button
May 31, 2019
Hanzhang Hu, John Langford, Rich Caruana, Saurajit Mukherjee, Eric Horvitz, Debadeepta Dey

Figure 1 for Efficient Forward Architecture Search
Figure 2 for Efficient Forward Architecture Search
Figure 3 for Efficient Forward Architecture Search
Figure 4 for Efficient Forward Architecture Search
Viaarxiv icon

Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting

Add code
Bookmark button
Alert button
Feb 05, 2019
Akshay Krishnamurthy, John Langford, Aleksandrs Slivkins, Chicheng Zhang

Figure 1 for Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting
Viaarxiv icon

Provably efficient RL with Rich Observations via Latent State Decoding

Add code
Bookmark button
Alert button
Jan 25, 2019
Simon S. Du, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal, Miroslav Dudík, John Langford

Figure 1 for Provably efficient RL with Rich Observations via Latent State Decoding
Viaarxiv icon

Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

Add code
Bookmark button
Alert button
Jan 02, 2019
Chicheng Zhang, Alekh Agarwal, Hal Daumé III, John Langford, Sahand N Negahban

Figure 1 for Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Figure 2 for Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Figure 3 for Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Viaarxiv icon