Picture for Robert Kirk

Robert Kirk

Analyzing the Generalization and Reliability of Steering Vectors -- ICML 2024

Add code
Jul 17, 2024
Viaarxiv icon

Leading the Pack: N-player Opponent Shaping

Add code
Dec 26, 2023
Figure 1 for Leading the Pack: N-player Opponent Shaping
Figure 2 for Leading the Pack: N-player Opponent Shaping
Figure 3 for Leading the Pack: N-player Opponent Shaping
Figure 4 for Leading the Pack: N-player Opponent Shaping
Viaarxiv icon

Generalization to New Sequential Decision Making Tasks with In-Context Learning

Add code
Dec 06, 2023
Viaarxiv icon

Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks

Add code
Nov 21, 2023
Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Oct 10, 2023
Figure 1 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 2 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 3 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 4 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Viaarxiv icon

Reward Model Ensembles Help Mitigate Overoptimization

Add code
Oct 04, 2023
Viaarxiv icon

Domain Generalization for Robust Model-Based Offline Reinforcement Learning

Add code
Nov 27, 2022
Figure 1 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 2 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 3 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 4 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Viaarxiv icon

Graph Backup: Data Efficient Backup Exploiting Markovian Transitions

Add code
May 31, 2022
Figure 1 for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Figure 2 for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Figure 3 for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Figure 4 for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Viaarxiv icon

Insights From the NeurIPS 2021 NetHack Challenge

Add code
Mar 22, 2022
Figure 1 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 2 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 3 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 4 for Insights From the NeurIPS 2021 NetHack Challenge
Viaarxiv icon

A Survey of Generalisation in Deep Reinforcement Learning

Add code
Nov 18, 2021
Figure 1 for A Survey of Generalisation in Deep Reinforcement Learning
Figure 2 for A Survey of Generalisation in Deep Reinforcement Learning
Figure 3 for A Survey of Generalisation in Deep Reinforcement Learning
Figure 4 for A Survey of Generalisation in Deep Reinforcement Learning
Viaarxiv icon