Alert button
Picture for Lucy Campbell-Gillingham

Lucy Campbell-Gillingham

Alert button

Fine-tuning language models to find agreement among humans with diverse preferences

Add code
Bookmark button
Alert button
Nov 28, 2022
Michiel A. Bakker, Martin J. Chadwick, Hannah R. Sheahan, Michael Henry Tessler, Lucy Campbell-Gillingham, Jan Balaguer, Nat McAleese, Amelia Glaese, John Aslanides, Matthew M. Botvinick, Christopher Summerfield

Figure 1 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 2 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 3 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 4 for Fine-tuning language models to find agreement among humans with diverse preferences
Viaarxiv icon

Improving alignment of dialogue agents via targeted human judgements

Add code
Bookmark button
Alert button
Sep 28, 2022
Amelia Glaese, Nat McAleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, Lucy Campbell-Gillingham, Jonathan Uesato, Po-Sen Huang, Ramona Comanescu, Fan Yang, Abigail See, Sumanth Dathathri, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Soňa Mokrá, Nicholas Fernando, Boxi Wu, Rachel Foley, Susannah Young, Iason Gabriel, William Isaac, John Mellor, Demis Hassabis, Koray Kavukcuoglu, Lisa Anne Hendricks, Geoffrey Irving

Figure 1 for Improving alignment of dialogue agents via targeted human judgements
Figure 2 for Improving alignment of dialogue agents via targeted human judgements
Figure 3 for Improving alignment of dialogue agents via targeted human judgements
Figure 4 for Improving alignment of dialogue agents via targeted human judgements
Viaarxiv icon

Teaching language models to support answers with verified quotes

Add code
Bookmark button
Alert button
Mar 21, 2022
Jacob Menick, Maja Trebacz, Vladimir Mikulik, John Aslanides, Francis Song, Martin Chadwick, Mia Glaese, Susannah Young, Lucy Campbell-Gillingham, Geoffrey Irving, Nat McAleese

Figure 1 for Teaching language models to support answers with verified quotes
Figure 2 for Teaching language models to support answers with verified quotes
Figure 3 for Teaching language models to support answers with verified quotes
Figure 4 for Teaching language models to support answers with verified quotes
Viaarxiv icon

HCMD-zero: Learning Value Aligned Mechanisms from Data

Add code
Bookmark button
Alert button
Feb 21, 2022
Jan Balaguer, Raphael Koster, Ari Weinstein, Lucy Campbell-Gillingham, Christopher Summerfield, Matthew Botvinick, Andrea Tacchetti

Figure 1 for HCMD-zero: Learning Value Aligned Mechanisms from Data
Figure 2 for HCMD-zero: Learning Value Aligned Mechanisms from Data
Figure 3 for HCMD-zero: Learning Value Aligned Mechanisms from Data
Figure 4 for HCMD-zero: Learning Value Aligned Mechanisms from Data
Viaarxiv icon

Human-centered mechanism design with Democratic AI

Add code
Bookmark button
Alert button
Jan 27, 2022
Raphael Koster, Jan Balaguer, Andrea Tacchetti, Ari Weinstein, Tina Zhu, Oliver Hauser, Duncan Williams, Lucy Campbell-Gillingham, Phoebe Thacker, Matthew Botvinick, Christopher Summerfield

Figure 1 for Human-centered mechanism design with Democratic AI
Figure 2 for Human-centered mechanism design with Democratic AI
Figure 3 for Human-centered mechanism design with Democratic AI
Figure 4 for Human-centered mechanism design with Democratic AI
Viaarxiv icon