Picture for Leon Lang

Leon Lang

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

Add code
Jun 22, 2024
Viaarxiv icon

When Your AIs Deceive You: Challenges with Partial Observability of Human Evaluators in Reward Learning

Add code
Mar 03, 2024
Figure 1 for When Your AIs Deceive You: Challenges with Partial Observability of Human Evaluators in Reward Learning
Figure 2 for When Your AIs Deceive You: Challenges with Partial Observability of Human Evaluators in Reward Learning
Figure 3 for When Your AIs Deceive You: Challenges with Partial Observability of Human Evaluators in Reward Learning
Figure 4 for When Your AIs Deceive You: Challenges with Partial Observability of Human Evaluators in Reward Learning
Viaarxiv icon

A Wigner-Eckart Theorem for Group Equivariant Convolution Kernels

Add code
Oct 22, 2020
Viaarxiv icon

Learning to Request Guidance in Emergent Communication

Add code
Dec 11, 2019
Figure 1 for Learning to Request Guidance in Emergent Communication
Figure 2 for Learning to Request Guidance in Emergent Communication
Figure 3 for Learning to Request Guidance in Emergent Communication
Figure 4 for Learning to Request Guidance in Emergent Communication
Viaarxiv icon