Alert button
Picture for Russell Kaplan

Russell Kaplan

Alert button

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Add code
Bookmark button
Alert button
Mar 06, 2024
Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer, Andy Zou, Mantas Mazeika, Zifan Wang, Palash Oswal, Weiran Liu, Adam A. Hunt, Justin Tienken-Harder, Kevin Y. Shih, Kemper Talley, John Guan, Russell Kaplan, Ian Steneker, David Campbell, Brad Jokubaitis, Alex Levinson, Jean Wang, William Qian, Kallol Krishna Karmakar, Steven Basart, Stephen Fitz, Mindy Levine, Ponnurangam Kumaraguru, Uday Tupakula, Vijay Varadharajan, Yan Shoshitaishvili, Jimmy Ba, Kevin M. Esvelt, Alexandr Wang, Dan Hendrycks

Figure 1 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 2 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 3 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 4 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Viaarxiv icon

Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs

Add code
Bookmark button
Alert button
Apr 28, 2023
George Pu, Anirudh Jain, Jihan Yin, Russell Kaplan

Figure 1 for Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs
Figure 2 for Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs
Figure 3 for Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs
Viaarxiv icon

HiDDeN: Hiding Data With Deep Networks

Add code
Bookmark button
Alert button
Jul 26, 2018
Jiren Zhu, Russell Kaplan, Justin Johnson, Li Fei-Fei

Figure 1 for HiDDeN: Hiding Data With Deep Networks
Figure 2 for HiDDeN: Hiding Data With Deep Networks
Figure 3 for HiDDeN: Hiding Data With Deep Networks
Figure 4 for HiDDeN: Hiding Data With Deep Networks
Viaarxiv icon

Beating Atari with Natural Language Guided Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 18, 2017
Russell Kaplan, Christopher Sauer, Alexander Sosa

Figure 1 for Beating Atari with Natural Language Guided Reinforcement Learning
Figure 2 for Beating Atari with Natural Language Guided Reinforcement Learning
Figure 3 for Beating Atari with Natural Language Guided Reinforcement Learning
Figure 4 for Beating Atari with Natural Language Guided Reinforcement Learning
Viaarxiv icon