Picture for Jonathan Stray

Jonathan Stray

Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild

Add code
Nov 13, 2023
Figure 1 for Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild
Figure 2 for Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild
Figure 3 for Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild
Figure 4 for Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild
Viaarxiv icon

Building Human Values into Recommender Systems: An Interdisciplinary Synthesis

Add code
Jul 20, 2022
Figure 1 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 2 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 3 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 4 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Viaarxiv icon

What are you optimizing for? Aligning Recommender Systems with Human Values

Add code
Jul 22, 2021
Viaarxiv icon

Designing Recommender Systems to Depolarize

Add code
Jul 11, 2021
Viaarxiv icon