Picture for Jelena Luketina

Jelena Luketina

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Oct 10, 2023
Figure 1 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 2 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 3 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 4 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Viaarxiv icon

Meta-Gradients in Non-Stationary Environments

Add code
Sep 13, 2022
Figure 1 for Meta-Gradients in Non-Stationary Environments
Figure 2 for Meta-Gradients in Non-Stationary Environments
Figure 3 for Meta-Gradients in Non-Stationary Environments
Figure 4 for Meta-Gradients in Non-Stationary Environments
Viaarxiv icon

WordCraft: An Environment for Benchmarking Commonsense Agents

Add code
Jul 17, 2020
Figure 1 for WordCraft: An Environment for Benchmarking Commonsense Agents
Figure 2 for WordCraft: An Environment for Benchmarking Commonsense Agents
Figure 3 for WordCraft: An Environment for Benchmarking Commonsense Agents
Figure 4 for WordCraft: An Environment for Benchmarking Commonsense Agents
Viaarxiv icon

The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning

Add code
Jun 16, 2020
Figure 1 for The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
Figure 2 for The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
Figure 3 for The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
Figure 4 for The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
Viaarxiv icon

A Survey of Reinforcement Learning Informed by Natural Language

Add code
Jun 10, 2019
Figure 1 for A Survey of Reinforcement Learning Informed by Natural Language
Viaarxiv icon

Progress & Compress: A scalable framework for continual learning

Add code
Jul 02, 2018
Figure 1 for Progress & Compress: A scalable framework for continual learning
Figure 2 for Progress & Compress: A scalable framework for continual learning
Figure 3 for Progress & Compress: A scalable framework for continual learning
Figure 4 for Progress & Compress: A scalable framework for continual learning
Viaarxiv icon

Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters

Add code
Jun 17, 2016
Figure 1 for Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters
Figure 2 for Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters
Figure 3 for Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters
Figure 4 for Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters
Viaarxiv icon