Alert button
Picture for Vincent Zhuang

Vincent Zhuang

Alert button

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Add code
Bookmark button
Alert button
Feb 18, 2024
Jacky Liang, Fei Xia, Wenhao Yu, Andy Zeng, Montserrat Gonzalez Arenas, Maria Attarian, Maria Bauza, Matthew Bennice, Alex Bewley, Adil Dostmohamed, Chuyuan Kelly Fu, Nimrod Gileadi, Marissa Giustina, Keerthana Gopalakrishnan, Leonard Hasenclever, Jan Humplik, Jasmine Hsu, Nikhil Joshi, Ben Jyenis, Chase Kew, Sean Kirmani, Tsang-Wei Edward Lee, Kuang-Huei Lee, Assaf Hurwitz Michaely, Joss Moore, Ken Oslund, Dushyant Rao, Allen Ren, Baruch Tabanpour, Quan Vuong, Ayzaan Wahid, Ted Xiao, Ying Xu, Vincent Zhuang, Peng Xu, Erik Frey, Ken Caluwaerts, Tingnan Zhang, Brian Ichter, Jonathan Tompson, Leila Takayama, Vincent Vanhoucke, Izhak Shafran, Maja Mataric, Dorsa Sadigh, Nicolas Heess, Kanishka Rao, Nik Stewart, Jie Tan, Carolina Parada

Viaarxiv icon

Kepler: Robust Learning for Faster Parametric Query Optimization

Add code
Bookmark button
Alert button
Jun 11, 2023
Lyric Doshi, Vincent Zhuang, Gaurav Jain, Ryan Marcus, Haoyu Huang, Deniz Altinbüken, Eugene Brevdo, Campbell Fraser

Figure 1 for Kepler: Robust Learning for Faster Parametric Query Optimization
Figure 2 for Kepler: Robust Learning for Faster Parametric Query Optimization
Figure 3 for Kepler: Robust Learning for Faster Parametric Query Optimization
Figure 4 for Kepler: Robust Learning for Faster Parametric Query Optimization
Viaarxiv icon

Barkour: Benchmarking Animal-level Agility with Quadruped Robots

Add code
Bookmark button
Alert button
May 24, 2023
Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Edward Lee, Linda Luu, Ofir Nachum, Ken Oslund, Jason Powell, Diego Reyes, Francesco Romano, Feresteh Sadeghi, Ron Sloat, Baruch Tabanpour, Daniel Zheng, Michael Neunert, Raia Hadsell, Nicolas Heess, Francesco Nori, Jeff Seto, Carolina Parada, Vikas Sindhwani, Vincent Vanhoucke, Jie Tan

Figure 1 for Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Figure 2 for Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Figure 3 for Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Figure 4 for Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Viaarxiv icon

No-Regret Reinforcement Learning with Heavy-Tailed Rewards

Add code
Bookmark button
Alert button
Feb 25, 2021
Vincent Zhuang, Yanan Sui

Figure 1 for No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Figure 2 for No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Figure 3 for No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Figure 4 for No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Viaarxiv icon

Stagewise Safe Bayesian Optimization with Gaussian Processes

Add code
Bookmark button
Alert button
Jun 20, 2018
Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue

Figure 1 for Stagewise Safe Bayesian Optimization with Gaussian Processes
Figure 2 for Stagewise Safe Bayesian Optimization with Gaussian Processes
Figure 3 for Stagewise Safe Bayesian Optimization with Gaussian Processes
Figure 4 for Stagewise Safe Bayesian Optimization with Gaussian Processes
Viaarxiv icon

Multi-dueling Bandits with Dependent Arms

Add code
Bookmark button
Alert button
Apr 29, 2017
Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue

Figure 1 for Multi-dueling Bandits with Dependent Arms
Figure 2 for Multi-dueling Bandits with Dependent Arms
Figure 3 for Multi-dueling Bandits with Dependent Arms
Figure 4 for Multi-dueling Bandits with Dependent Arms
Viaarxiv icon