Picture for Guy Davidson

Guy Davidson

SAGE-Eval: Evaluating LLMs for Systematic Generalizations of Safety Facts

Add code
May 27, 2025
Viaarxiv icon

Do different prompting methods yield a common task representation in language models?

Add code
May 17, 2025
Viaarxiv icon

Goals as Reward-Producing Programs

Add code
May 21, 2024
Viaarxiv icon

Toward Human-AI Alignment in Large-Scale Multi-Player Games

Add code
Feb 05, 2024
Figure 1 for Toward Human-AI Alignment in Large-Scale Multi-Player Games
Figure 2 for Toward Human-AI Alignment in Large-Scale Multi-Player Games
Figure 3 for Toward Human-AI Alignment in Large-Scale Multi-Player Games
Figure 4 for Toward Human-AI Alignment in Large-Scale Multi-Player Games
Viaarxiv icon

Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning

Add code
Feb 16, 2020
Figure 1 for Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning
Figure 2 for Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning
Figure 3 for Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning
Figure 4 for Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning
Viaarxiv icon

Sequential mastery of multiple tasks: Networks naturally learn to learn

Add code
May 28, 2019
Figure 1 for Sequential mastery of multiple tasks: Networks naturally learn to learn
Figure 2 for Sequential mastery of multiple tasks: Networks naturally learn to learn
Figure 3 for Sequential mastery of multiple tasks: Networks naturally learn to learn
Figure 4 for Sequential mastery of multiple tasks: Networks naturally learn to learn
Viaarxiv icon