Alert button
Picture for Zhang-Wei Hong

Zhang-Wei Hong

Alert button

Curiosity-driven Red-teaming for Large Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James Glass, Akash Srivastava, Pulkit Agrawal

Viaarxiv icon

Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity

Add code
Bookmark button
Alert button
Oct 26, 2023
Jaedong Hwang, Zhang-Wei Hong, Eric Chen, Akhilan Boopathy, Pulkit Agrawal, Ila Fiete

Viaarxiv icon

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

Add code
Bookmark button
Alert button
Oct 12, 2023
Zhang-Wei Hong, Aviral Kumar, Sathwik Karnik, Abhishek Bhandwaldar, Akash Srivastava, Joni Pajarinen, Romain Laroche, Abhishek Gupta, Pulkit Agrawal

Viaarxiv icon

Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation

Add code
Bookmark button
Alert button
Jul 24, 2023
Zechu Li, Tao Chen, Zhang-Wei Hong, Anurag Ajay, Pulkit Agrawal

Viaarxiv icon

Neuro-Inspired Efficient Map Building via Fragmentation and Recall

Add code
Bookmark button
Alert button
Jul 11, 2023
Jaedong Hwang, Zhang-Wei Hong, Eric Chen, Akhilan Boopathy, Pulkit Agrawal, Ila Fiete

Figure 1 for Neuro-Inspired Efficient Map Building via Fragmentation and Recall
Figure 2 for Neuro-Inspired Efficient Map Building via Fragmentation and Recall
Figure 3 for Neuro-Inspired Efficient Map Building via Fragmentation and Recall
Figure 4 for Neuro-Inspired Efficient Map Building via Fragmentation and Recall
Viaarxiv icon

TGRL: An Algorithm for Teacher Guided Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 06, 2023
Idan Shenfeld, Zhang-Wei Hong, Aviv Tamar, Pulkit Agrawal

Figure 1 for TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Figure 2 for TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Figure 3 for TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Figure 4 for TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Viaarxiv icon

Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting

Add code
Bookmark button
Alert button
Jun 22, 2023
Zhang-Wei Hong, Pulkit Agrawal, Rémi Tachet des Combes, Romain Laroche

Figure 1 for Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting
Figure 2 for Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting
Figure 3 for Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting
Figure 4 for Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting
Viaarxiv icon

Redeeming Intrinsic Rewards via Constrained Optimization

Add code
Bookmark button
Alert button
Nov 18, 2022
Eric Chen, Zhang-Wei Hong, Joni Pajarinen, Pulkit Agrawal

Figure 1 for Redeeming Intrinsic Rewards via Constrained Optimization
Figure 2 for Redeeming Intrinsic Rewards via Constrained Optimization
Figure 3 for Redeeming Intrinsic Rewards via Constrained Optimization
Figure 4 for Redeeming Intrinsic Rewards via Constrained Optimization
Viaarxiv icon

Model Predictive Control via On-Policy Imitation Learning

Add code
Bookmark button
Alert button
Oct 17, 2022
Kwangjun Ahn, Zakaria Mhammedi, Horia Mania, Zhang-Wei Hong, Ali Jadbabaie

Figure 1 for Model Predictive Control via On-Policy Imitation Learning
Figure 2 for Model Predictive Control via On-Policy Imitation Learning
Figure 3 for Model Predictive Control via On-Policy Imitation Learning
Viaarxiv icon