Alert button
Picture for Hengyuan Hu

Hengyuan Hu

Alert button

Imitation Bootstrapped Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 20, 2023
Hengyuan Hu, Suvir Mirchandani, Dorsa Sadigh

Viaarxiv icon

Toward Grounded Social Reasoning

Add code
Bookmark button
Alert button
Jun 14, 2023
Minae Kwon, Hengyuan Hu, Vivek Myers, Siddharth Karamcheti, Anca Dragan, Dorsa Sadigh

Figure 1 for Toward Grounded Social Reasoning
Figure 2 for Toward Grounded Social Reasoning
Figure 3 for Toward Grounded Social Reasoning
Figure 4 for Toward Grounded Social Reasoning
Viaarxiv icon

The Update Equivalence Framework for Decision-Time Planning

Add code
Bookmark button
Alert button
Apr 25, 2023
Samuel Sokota, Gabriele Farina, David J. Wu, Hengyuan Hu, Kevin A. Wang, J. Zico Kolter, Noam Brown

Figure 1 for The Update Equivalence Framework for Decision-Time Planning
Figure 2 for The Update Equivalence Framework for Decision-Time Planning
Figure 3 for The Update Equivalence Framework for Decision-Time Planning
Figure 4 for The Update Equivalence Framework for Decision-Time Planning
Viaarxiv icon

Language Instructed Reinforcement Learning for Human-AI Coordination

Add code
Bookmark button
Alert button
Apr 13, 2023
Hengyuan Hu, Dorsa Sadigh

Figure 1 for Language Instructed Reinforcement Learning for Human-AI Coordination
Figure 2 for Language Instructed Reinforcement Learning for Human-AI Coordination
Figure 3 for Language Instructed Reinforcement Learning for Human-AI Coordination
Figure 4 for Language Instructed Reinforcement Learning for Human-AI Coordination
Viaarxiv icon

Human-AI Coordination via Human-Regularized Search and Learning

Add code
Bookmark button
Alert button
Oct 11, 2022
Hengyuan Hu, David J Wu, Adam Lerer, Jakob Foerster, Noam Brown

Figure 1 for Human-AI Coordination via Human-Regularized Search and Learning
Figure 2 for Human-AI Coordination via Human-Regularized Search and Learning
Figure 3 for Human-AI Coordination via Human-Regularized Search and Learning
Viaarxiv icon

K-level Reasoning for Zero-Shot Coordination in Hanabi

Add code
Bookmark button
Alert button
Jul 14, 2022
Brandon Cui, Hengyuan Hu, Luis Pineda, Jakob N. Foerster

Figure 1 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 2 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 3 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 4 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Viaarxiv icon

Scalable Online Planning via Reinforcement Learning Fine-Tuning

Add code
Bookmark button
Alert button
Sep 30, 2021
Arnaud Fickinger, Hengyuan Hu, Brandon Amos, Stuart Russell, Noam Brown

Figure 1 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 2 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 3 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 4 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Viaarxiv icon

Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings

Add code
Bookmark button
Alert button
Jun 16, 2021
Hengyuan Hu, Adam Lerer, Noam Brown, Jakob Foerster

Figure 1 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 2 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 3 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 4 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Viaarxiv icon

Off-Belief Learning

Add code
Bookmark button
Alert button
Mar 06, 2021
Hengyuan Hu, Adam Lerer, Brandon Cui, Luis Pineda, David Wu, Noam Brown, Jakob Foerster

Figure 1 for Off-Belief Learning
Figure 2 for Off-Belief Learning
Figure 3 for Off-Belief Learning
Figure 4 for Off-Belief Learning
Viaarxiv icon