Alert button
Picture for Jonathan Stray

Jonathan Stray

Alert button

Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild

Add code
Bookmark button
Alert button
Nov 13, 2023
Nanna Inie, Jonathan Stray, Leon Derczynski

Figure 1 for Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild
Figure 2 for Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild
Figure 3 for Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild
Figure 4 for Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild
Viaarxiv icon

Building Human Values into Recommender Systems: An Interdisciplinary Synthesis

Add code
Bookmark button
Alert button
Jul 20, 2022
Jonathan Stray, Alon Halevy, Parisa Assar, Dylan Hadfield-Menell, Craig Boutilier, Amar Ashar, Lex Beattie, Michael Ekstrand, Claire Leibowicz, Connie Moon Sehat, Sara Johansen, Lianne Kerlin, David Vickrey, Spandana Singh, Sanne Vrijenhoek, Amy Zhang, McKane Andrus, Natali Helberger, Polina Proutskova, Tanushree Mitra, Nina Vasan

Figure 1 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 2 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 3 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 4 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Viaarxiv icon

What are you optimizing for? Aligning Recommender Systems with Human Values

Add code
Bookmark button
Alert button
Jul 22, 2021
Jonathan Stray, Ivan Vendrov, Jeremy Nixon, Steven Adler, Dylan Hadfield-Menell

Viaarxiv icon

Designing Recommender Systems to Depolarize

Add code
Bookmark button
Alert button
Jul 11, 2021
Jonathan Stray

Viaarxiv icon