Alert button
Picture for Tom Brown

Tom Brown

Alert button

A General Language Assistant as a Laboratory for Alignment

Add code
Bookmark button
Alert button
Dec 09, 2021
Amanda Askell, Yuntao Bai, Anna Chen, Dawn Drain, Deep Ganguli, Tom Henighan, Andy Jones, Nicholas Joseph, Ben Mann, Nova DasSarma, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Jackson Kernion, Kamal Ndousse, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam McCandlish, Chris Olah, Jared Kaplan

Figure 1 for A General Language Assistant as a Laboratory for Alignment
Figure 2 for A General Language Assistant as a Laboratory for Alignment
Figure 3 for A General Language Assistant as a Laboratory for Alignment
Figure 4 for A General Language Assistant as a Laboratory for Alignment
Viaarxiv icon

Extracting Training Data from Large Language Models

Add code
Bookmark button
Alert button
Dec 14, 2020
Nicholas Carlini, Florian Tramer, Eric Wallace, Matthew Jagielski, Ariel Herbert-Voss, Katherine Lee, Adam Roberts, Tom Brown, Dawn Song, Ulfar Erlingsson, Alina Oprea, Colin Raffel

Figure 1 for Extracting Training Data from Large Language Models
Figure 2 for Extracting Training Data from Large Language Models
Figure 3 for Extracting Training Data from Large Language Models
Figure 4 for Extracting Training Data from Large Language Models
Viaarxiv icon

Testing Robustness Against Unforeseen Adversaries

Add code
Bookmark button
Alert button
Aug 21, 2019
Daniel Kang, Yi Sun, Dan Hendrycks, Tom Brown, Jacob Steinhardt

Figure 1 for Testing Robustness Against Unforeseen Adversaries
Figure 2 for Testing Robustness Against Unforeseen Adversaries
Figure 3 for Testing Robustness Against Unforeseen Adversaries
Figure 4 for Testing Robustness Against Unforeseen Adversaries
Viaarxiv icon

Transfer of Adversarial Robustness Between Perturbation Types

Add code
Bookmark button
Alert button
May 03, 2019
Daniel Kang, Yi Sun, Tom Brown, Dan Hendrycks, Jacob Steinhardt

Figure 1 for Transfer of Adversarial Robustness Between Perturbation Types
Figure 2 for Transfer of Adversarial Robustness Between Perturbation Types
Figure 3 for Transfer of Adversarial Robustness Between Perturbation Types
Figure 4 for Transfer of Adversarial Robustness Between Perturbation Types
Viaarxiv icon

Skill Rating for Generative Models

Add code
Bookmark button
Alert button
Aug 14, 2018
Catherine Olsson, Surya Bhupatiraju, Tom Brown, Augustus Odena, Ian Goodfellow

Figure 1 for Skill Rating for Generative Models
Figure 2 for Skill Rating for Generative Models
Figure 3 for Skill Rating for Generative Models
Figure 4 for Skill Rating for Generative Models
Viaarxiv icon

Technical Report on the CleverHans v2.1.0 Adversarial Examples Library

Add code
Bookmark button
Alert button
Jun 27, 2018
Nicolas Papernot, Fartash Faghri, Nicholas Carlini, Ian Goodfellow, Reuben Feinman, Alexey Kurakin, Cihang Xie, Yash Sharma, Tom Brown, Aurko Roy, Alexander Matyasko, Vahid Behzadan, Karen Hambardzumyan, Zhishuai Zhang, Yi-Lin Juang, Zhi Li, Ryan Sheatsley, Abhibhav Garg, Jonathan Uesato, Willi Gierke, Yinpeng Dong, David Berthelot, Paul Hendricks, Jonas Rauber, Rujun Long, Patrick McDaniel

Viaarxiv icon