Picture for Cameron Allen

Cameron Allen

BXRL: Behavior-Explainable Reinforcement Learning

Add code
Mar 24, 2026
Viaarxiv icon

Truthfulness Despite Weak Supervision: Evaluating and Training LLMs Using Peer Prediction

Add code
Jan 28, 2026
Viaarxiv icon

Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects

Add code
Oct 06, 2025
Figure 1 for Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects
Figure 2 for Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects
Figure 3 for Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects
Figure 4 for Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects
Viaarxiv icon

Benchmarking Partial Observability in Reinforcement Learning with a Suite of Memory-Improvable Domains

Add code
Jul 31, 2025
Viaarxiv icon

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

Add code
Jul 10, 2024
Figure 1 for Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Figure 2 for Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Figure 3 for Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Figure 4 for Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Viaarxiv icon

Evidence of Learned Look-Ahead in a Chess-Playing Neural Network

Add code
Jun 02, 2024
Figure 1 for Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Figure 2 for Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Figure 3 for Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Figure 4 for Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Viaarxiv icon

Characterizing the Action-Generalization Gap in Deep Q-Learning

Add code
May 11, 2022
Figure 1 for Characterizing the Action-Generalization Gap in Deep Q-Learning
Figure 2 for Characterizing the Action-Generalization Gap in Deep Q-Learning
Figure 3 for Characterizing the Action-Generalization Gap in Deep Q-Learning
Viaarxiv icon

Coarse-Grained Smoothness for RL in Metric Spaces

Add code
Oct 23, 2021
Figure 1 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 2 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 3 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 4 for Coarse-Grained Smoothness for RL in Metric Spaces
Viaarxiv icon

Bad-Policy Density: A Measure of Reinforcement Learning Hardness

Add code
Oct 07, 2021
Figure 1 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Figure 2 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Figure 3 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Figure 4 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Viaarxiv icon

Learning Markov State Abstractions for Deep Reinforcement Learning

Add code
Jun 08, 2021
Figure 1 for Learning Markov State Abstractions for Deep Reinforcement Learning
Figure 2 for Learning Markov State Abstractions for Deep Reinforcement Learning
Figure 3 for Learning Markov State Abstractions for Deep Reinforcement Learning
Figure 4 for Learning Markov State Abstractions for Deep Reinforcement Learning
Viaarxiv icon