Picture for Stephen Chung

Stephen Chung

The Station: An Open-World Environment for AI-Driven Discovery

Add code
Nov 09, 2025
Figure 1 for The Station: An Open-World Environment for AI-Driven Discovery
Figure 2 for The Station: An Open-World Environment for AI-Driven Discovery
Figure 3 for The Station: An Open-World Environment for AI-Driven Discovery
Figure 4 for The Station: An Open-World Environment for AI-Driven Discovery
Viaarxiv icon

Thinker: Learning to Think Fast and Slow

Add code
May 27, 2025
Figure 1 for Thinker: Learning to Think Fast and Slow
Figure 2 for Thinker: Learning to Think Fast and Slow
Figure 3 for Thinker: Learning to Think Fast and Slow
Figure 4 for Thinker: Learning to Think Fast and Slow
Viaarxiv icon

Learning from Peers in Reasoning Models

Add code
May 12, 2025
Viaarxiv icon

Interpreting Emergent Planning in Model-Free Reinforcement Learning

Add code
Apr 02, 2025
Viaarxiv icon

Handling Delay in Real-Time Reinforcement Learning

Add code
Mar 30, 2025
Viaarxiv icon

Learning from Failures in Multi-Attempt Reinforcement Learning

Add code
Mar 04, 2025
Viaarxiv icon

Predicting Future Actions of Reinforcement Learning Agents

Add code
Oct 29, 2024
Figure 1 for Predicting Future Actions of Reinforcement Learning Agents
Figure 2 for Predicting Future Actions of Reinforcement Learning Agents
Figure 3 for Predicting Future Actions of Reinforcement Learning Agents
Figure 4 for Predicting Future Actions of Reinforcement Learning Agents
Viaarxiv icon

Thinker: Learning to Plan and Act

Add code
Jul 27, 2023
Viaarxiv icon

Unbiased Weight Maximization

Add code
Jul 25, 2023
Figure 1 for Unbiased Weight Maximization
Figure 2 for Unbiased Weight Maximization
Figure 3 for Unbiased Weight Maximization
Figure 4 for Unbiased Weight Maximization
Viaarxiv icon

Structural Credit Assignment with Coordinated Exploration

Add code
Jul 25, 2023
Figure 1 for Structural Credit Assignment with Coordinated Exploration
Figure 2 for Structural Credit Assignment with Coordinated Exploration
Figure 3 for Structural Credit Assignment with Coordinated Exploration
Figure 4 for Structural Credit Assignment with Coordinated Exploration
Viaarxiv icon