Picture for Christoforos Nalmpantis

Christoforos Nalmpantis

Teaching Large Language Models to Reason with Reinforcement Learning

Add code
Mar 07, 2024
Figure 1 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 2 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 3 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 4 for Teaching Large Language Models to Reason with Reinforcement Learning
Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Oct 10, 2023
Figure 1 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 2 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 3 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Figure 4 for Understanding the Effects of RLHF on LLM Generalisation and Diversity
Viaarxiv icon

Neurons in Large Language Models: Dead, N-gram, Positional

Add code
Sep 09, 2023
Figure 1 for Neurons in Large Language Models: Dead, N-gram, Positional
Figure 2 for Neurons in Large Language Models: Dead, N-gram, Positional
Figure 3 for Neurons in Large Language Models: Dead, N-gram, Positional
Figure 4 for Neurons in Large Language Models: Dead, N-gram, Positional
Viaarxiv icon

Augmented Language Models: a Survey

Add code
Feb 15, 2023
Figure 1 for Augmented Language Models: a Survey
Figure 2 for Augmented Language Models: a Survey
Figure 3 for Augmented Language Models: a Survey
Figure 4 for Augmented Language Models: a Survey
Viaarxiv icon

PEER: A Collaborative Language Model

Add code
Aug 24, 2022
Figure 1 for PEER: A Collaborative Language Model
Figure 2 for PEER: A Collaborative Language Model
Figure 3 for PEER: A Collaborative Language Model
Figure 4 for PEER: A Collaborative Language Model
Viaarxiv icon