Alert button
Picture for Michael Luo

Michael Luo

Alert button

Balsa: Learning a Query Optimizer Without Expert Demonstrations

Add code
Bookmark button
Alert button
Jan 05, 2022
Zongheng Yang, Wei-Lin Chiang, Sifei Luan, Gautam Mittal, Michael Luo, Ion Stoica

Figure 1 for Balsa: Learning a Query Optimizer Without Expert Demonstrations
Figure 2 for Balsa: Learning a Query Optimizer Without Expert Demonstrations
Figure 3 for Balsa: Learning a Query Optimizer Without Expert Demonstrations
Figure 4 for Balsa: Learning a Query Optimizer Without Expert Demonstrations
Viaarxiv icon

MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance

Add code
Bookmark button
Alert button
Dec 07, 2021
Michael Luo, Ashwin Balakrishna, Brijen Thananjeyan, Suraj Nair, Julian Ibarz, Jie Tan, Chelsea Finn, Ion Stoica, Ken Goldberg

Figure 1 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Figure 2 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Figure 3 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Figure 4 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Viaarxiv icon

Discovering Non-monotonic Autoregressive Orderings with Variational Inference

Add code
Bookmark button
Alert button
Oct 27, 2021
Xuanlin Li, Brandon Trabucco, Dong Huk Park, Michael Luo, Sheng Shen, Trevor Darrell, Yang Gao

Figure 1 for Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Figure 2 for Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Figure 3 for Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Figure 4 for Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Viaarxiv icon

Accelerating Quadratic Optimization with Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 22, 2021
Jeffrey Ichnowski, Paras Jain, Bartolomeo Stellato, Goran Banjac, Michael Luo, Francesco Borrelli, Joseph E. Gonzalez, Ion Stoica, Ken Goldberg

Figure 1 for Accelerating Quadratic Optimization with Reinforcement Learning
Figure 2 for Accelerating Quadratic Optimization with Reinforcement Learning
Figure 3 for Accelerating Quadratic Optimization with Reinforcement Learning
Figure 4 for Accelerating Quadratic Optimization with Reinforcement Learning
Viaarxiv icon

LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

Add code
Bookmark button
Alert button
Mar 31, 2021
Ryan Hoque, Ashwin Balakrishna, Carl Putterman, Michael Luo, Daniel S. Brown, Daniel Seita, Brijen Thananjeyan, Ellen Novoseller, Ken Goldberg

Figure 1 for LazyDAgger: Reducing Context Switching in Interactive Imitation Learning
Figure 2 for LazyDAgger: Reducing Context Switching in Interactive Imitation Learning
Figure 3 for LazyDAgger: Reducing Context Switching in Interactive Imitation Learning
Figure 4 for LazyDAgger: Reducing Context Switching in Interactive Imitation Learning
Viaarxiv icon

Distributed Reinforcement Learning is a Dataflow Problem

Add code
Bookmark button
Alert button
Dec 03, 2020
Eric Liang, Zhanghao Wu, Michael Luo, Sven Mika, Ion Stoica

Figure 1 for Distributed Reinforcement Learning is a Dataflow Problem
Figure 2 for Distributed Reinforcement Learning is a Dataflow Problem
Figure 3 for Distributed Reinforcement Learning is a Dataflow Problem
Figure 4 for Distributed Reinforcement Learning is a Dataflow Problem
Viaarxiv icon

Connecting Context-specific Adaptation in Humans to Meta-learning

Add code
Bookmark button
Alert button
Dec 01, 2020
Rachit Dubey, Erin Grant, Michael Luo, Karthik Narasimhan, Thomas Griffiths

Figure 1 for Connecting Context-specific Adaptation in Humans to Meta-learning
Figure 2 for Connecting Context-specific Adaptation in Humans to Meta-learning
Figure 3 for Connecting Context-specific Adaptation in Humans to Meta-learning
Figure 4 for Connecting Context-specific Adaptation in Humans to Meta-learning
Viaarxiv icon

Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones

Add code
Bookmark button
Alert button
Oct 29, 2020
Brijen Thananjeyan, Ashwin Balakrishna, Suraj Nair, Michael Luo, Krishnan Srinivasan, Minho Hwang, Joseph E. Gonzalez, Julian Ibarz, Chelsea Finn, Ken Goldberg

Figure 1 for Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Figure 2 for Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Figure 3 for Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Figure 4 for Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Viaarxiv icon

IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks

Add code
Bookmark button
Alert button
Jan 23, 2020
Michael Luo, Jiahao Yao, Richard Liaw, Eric Liang, Ion Stoica

Figure 1 for IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks
Figure 2 for IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks
Figure 3 for IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks
Figure 4 for IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks
Viaarxiv icon