Picture for Lewis Hammond

Lewis Hammond

IDs for AI Systems

Add code
Jun 17, 2024
Viaarxiv icon

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Add code
Apr 15, 2024
Viaarxiv icon

Cooperation and Control in Delegation Games

Add code
Feb 24, 2024
Viaarxiv icon

Secret Collusion Among Generative AI Agents

Add code
Feb 12, 2024
Figure 1 for Secret Collusion Among Generative AI Agents
Figure 2 for Secret Collusion Among Generative AI Agents
Figure 3 for Secret Collusion Among Generative AI Agents
Figure 4 for Secret Collusion Among Generative AI Agents
Viaarxiv icon

Visibility into AI Agents

Add code
Feb 04, 2024
Viaarxiv icon

Welfare Diplomacy: Benchmarking Language Model Cooperation

Add code
Oct 13, 2023
Figure 1 for Welfare Diplomacy: Benchmarking Language Model Cooperation
Figure 2 for Welfare Diplomacy: Benchmarking Language Model Cooperation
Figure 3 for Welfare Diplomacy: Benchmarking Language Model Cooperation
Figure 4 for Welfare Diplomacy: Benchmarking Language Model Cooperation
Viaarxiv icon

On Imperfect Recall in Multi-Agent Influence Diagrams

Add code
Jul 11, 2023
Figure 1 for On Imperfect Recall in Multi-Agent Influence Diagrams
Figure 2 for On Imperfect Recall in Multi-Agent Influence Diagrams
Figure 3 for On Imperfect Recall in Multi-Agent Influence Diagrams
Figure 4 for On Imperfect Recall in Multi-Agent Influence Diagrams
Viaarxiv icon

Reasoning about Causality in Games

Add code
Jan 05, 2023
Figure 1 for Reasoning about Causality in Games
Figure 2 for Reasoning about Causality in Games
Figure 3 for Reasoning about Causality in Games
Figure 4 for Reasoning about Causality in Games
Viaarxiv icon

Lexicographic Multi-Objective Reinforcement Learning

Add code
Dec 28, 2022
Figure 1 for Lexicographic Multi-Objective Reinforcement Learning
Figure 2 for Lexicographic Multi-Objective Reinforcement Learning
Viaarxiv icon

Observational Robustness and Invariances in Reinforcement Learning via Lexicographic Objectives

Add code
Sep 30, 2022
Figure 1 for Observational Robustness and Invariances in Reinforcement Learning via Lexicographic Objectives
Figure 2 for Observational Robustness and Invariances in Reinforcement Learning via Lexicographic Objectives
Figure 3 for Observational Robustness and Invariances in Reinforcement Learning via Lexicographic Objectives
Figure 4 for Observational Robustness and Invariances in Reinforcement Learning via Lexicographic Objectives
Viaarxiv icon