Picture for Eric Hambro

Eric Hambro

Know When To Stop: A Study of Semantic Drift in Text Generation

Add code
Apr 08, 2024
Viaarxiv icon

Teaching Large Language Models to Reason with Reinforcement Learning

Add code
Mar 07, 2024
Figure 1 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 2 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 3 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 4 for Teaching Large Language Models to Reason with Reinforcement Learning
Viaarxiv icon

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Add code
Feb 26, 2024
Figure 1 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 2 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 3 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 4 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Viaarxiv icon

GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements

Add code
Feb 13, 2024
Figure 1 for GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements
Figure 2 for GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements
Figure 3 for GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements
Figure 4 for GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements
Viaarxiv icon

Generalization to New Sequential Decision Making Tasks with In-Context Learning

Add code
Dec 06, 2023
Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Oct 10, 2023
Figure 1 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 2 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 3 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 4 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Viaarxiv icon

LLaMA: Open and Efficient Foundation Language Models

Add code
Feb 27, 2023
Figure 1 for LLaMA: Open and Efficient Foundation Language Models
Figure 2 for LLaMA: Open and Efficient Foundation Language Models
Figure 3 for LLaMA: Open and Efficient Foundation Language Models
Figure 4 for LLaMA: Open and Efficient Foundation Language Models
Viaarxiv icon

Dungeons and Data: A Large-Scale NetHack Dataset

Add code
Nov 22, 2022
Figure 1 for Dungeons and Data: A Large-Scale NetHack Dataset
Figure 2 for Dungeons and Data: A Large-Scale NetHack Dataset
Figure 3 for Dungeons and Data: A Large-Scale NetHack Dataset
Figure 4 for Dungeons and Data: A Large-Scale NetHack Dataset
Viaarxiv icon

Insights From the NeurIPS 2021 NetHack Challenge

Add code
Mar 22, 2022
Figure 1 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 2 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 3 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 4 for Insights From the NeurIPS 2021 NetHack Challenge
Viaarxiv icon

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Add code
Sep 27, 2021
Figure 1 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Figure 2 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Figure 3 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Figure 4 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Viaarxiv icon