Picture for Vincent Zhuang

Vincent Zhuang

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Add code
Feb 18, 2024
Figure 1 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 2 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 3 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 4 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Viaarxiv icon

Kepler: Robust Learning for Faster Parametric Query Optimization

Add code
Jun 11, 2023
Figure 1 for Kepler: Robust Learning for Faster Parametric Query Optimization
Figure 2 for Kepler: Robust Learning for Faster Parametric Query Optimization
Figure 3 for Kepler: Robust Learning for Faster Parametric Query Optimization
Figure 4 for Kepler: Robust Learning for Faster Parametric Query Optimization
Viaarxiv icon

Barkour: Benchmarking Animal-level Agility with Quadruped Robots

Add code
May 24, 2023
Figure 1 for Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Figure 2 for Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Figure 3 for Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Figure 4 for Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Viaarxiv icon

No-Regret Reinforcement Learning with Heavy-Tailed Rewards

Add code
Feb 25, 2021
Figure 1 for No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Figure 2 for No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Figure 3 for No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Figure 4 for No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Viaarxiv icon

Stagewise Safe Bayesian Optimization with Gaussian Processes

Add code
Jun 20, 2018
Figure 1 for Stagewise Safe Bayesian Optimization with Gaussian Processes
Figure 2 for Stagewise Safe Bayesian Optimization with Gaussian Processes
Figure 3 for Stagewise Safe Bayesian Optimization with Gaussian Processes
Figure 4 for Stagewise Safe Bayesian Optimization with Gaussian Processes
Viaarxiv icon

Multi-dueling Bandits with Dependent Arms

Add code
Apr 29, 2017
Figure 1 for Multi-dueling Bandits with Dependent Arms
Figure 2 for Multi-dueling Bandits with Dependent Arms
Figure 3 for Multi-dueling Bandits with Dependent Arms
Figure 4 for Multi-dueling Bandits with Dependent Arms
Viaarxiv icon