Picture for Zhang-Wei Hong

Zhang-Wei Hong

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

Add code
Oct 17, 2024
Viaarxiv icon

Random Latent Exploration for Deep Reinforcement Learning

Add code
Jul 18, 2024
Viaarxiv icon

ROER: Regularized Optimal Experience Replay

Add code
Jul 04, 2024
Viaarxiv icon

Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models

Add code
Jun 06, 2024
Viaarxiv icon

Curiosity-driven Red-teaming for Large Language Models

Add code
Feb 29, 2024
Viaarxiv icon

Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity

Add code
Oct 26, 2023
Viaarxiv icon

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

Add code
Oct 12, 2023
Viaarxiv icon

Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation

Add code
Jul 24, 2023
Viaarxiv icon

Neuro-Inspired Efficient Map Building via Fragmentation and Recall

Add code
Jul 11, 2023
Viaarxiv icon

TGRL: An Algorithm for Teacher Guided Reinforcement Learning

Add code
Jul 06, 2023
Viaarxiv icon