Picture for Stephen Zhao

Stephen Zhao

Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo

Add code
Apr 26, 2024
Viaarxiv icon

Proximal Learning With Opponent-Learning Awareness

Add code
Oct 18, 2022
Figure 1 for Proximal Learning With Opponent-Learning Awareness
Figure 2 for Proximal Learning With Opponent-Learning Awareness
Figure 3 for Proximal Learning With Opponent-Learning Awareness
Figure 4 for Proximal Learning With Opponent-Learning Awareness
Viaarxiv icon

Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning

Add code
Jul 06, 2020
Figure 1 for Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Figure 2 for Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Figure 3 for Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Figure 4 for Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Viaarxiv icon