Alert button
Picture for Christoforos Nalmpantis

Christoforos Nalmpantis

Alert button

Teaching Large Language Models to Reason with Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 07, 2024
Alex Havrilla, Yuqing Du, Sharath Chandra Raparthy, Christoforos Nalmpantis, Jane Dwivedi-Yu, Maksym Zhuravinskyi, Eric Hambro, Sainbayar Sukhbaatar, Roberta Raileanu

Figure 1 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 2 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 3 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 4 for Teaching Large Language Models to Reason with Reinforcement Learning
Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Bookmark button
Alert button
Oct 10, 2023
Robert Kirk, Ishita Mediratta, Christoforos Nalmpantis, Jelena Luketina, Eric Hambro, Edward Grefenstette, Roberta Raileanu

Viaarxiv icon

Neurons in Large Language Models: Dead, N-gram, Positional

Add code
Bookmark button
Alert button
Sep 09, 2023
Elena Voita, Javier Ferrando, Christoforos Nalmpantis

Figure 1 for Neurons in Large Language Models: Dead, N-gram, Positional
Figure 2 for Neurons in Large Language Models: Dead, N-gram, Positional
Figure 3 for Neurons in Large Language Models: Dead, N-gram, Positional
Figure 4 for Neurons in Large Language Models: Dead, N-gram, Positional
Viaarxiv icon

Augmented Language Models: a Survey

Add code
Bookmark button
Alert button
Feb 15, 2023
Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-Yu, Asli Celikyilmaz, Edouard Grave, Yann LeCun, Thomas Scialom

Figure 1 for Augmented Language Models: a Survey
Figure 2 for Augmented Language Models: a Survey
Figure 3 for Augmented Language Models: a Survey
Figure 4 for Augmented Language Models: a Survey
Viaarxiv icon

PEER: A Collaborative Language Model

Add code
Bookmark button
Alert button
Aug 24, 2022
Timo Schick, Jane Dwivedi-Yu, Zhengbao Jiang, Fabio Petroni, Patrick Lewis, Gautier Izacard, Qingfei You, Christoforos Nalmpantis, Edouard Grave, Sebastian Riedel

Figure 1 for PEER: A Collaborative Language Model
Figure 2 for PEER: A Collaborative Language Model
Figure 3 for PEER: A Collaborative Language Model
Figure 4 for PEER: A Collaborative Language Model
Viaarxiv icon