Alert button
Picture for Leonard Tang

Leonard Tang

Alert button

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Bookmark button
Alert button
Apr 18, 2024
Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller, Ram Gandikota, Agasthya Gangavarapu, Ananya Gangavarapu, James Gealy, Rajat Ghosh, James Goel, Usman Gohar, Sujata Goswami, Scott A. Hale, Wiebke Hutiri, Joseph Marvin Imperial, Surgan Jandial, Nick Judd, Felix Juefei-Xu, Foutse Khomh, Bhavya Kailkhura, Hannah Rose Kirk, Kevin Klyman, Chris Knotz, Michael Kuchnik, Shachi H. Kumar, Chris Lengerich, Bo Li, Zeyi Liao, Eileen Peters Long, Victor Lu, Yifan Mai, Priyanka Mary Mammen, Kelvin Manyeki, Sean McGregor, Virendra Mehta, Shafee Mohammed, Emanuel Moss, Lama Nachman, Dinesh Jinenhally Naganna, Amin Nikanjam, Besmira Nushi, Luis Oala, Iftach Orr, Alicia Parrish, Cigdem Patlak, William Pietri, Forough Poursabzi-Sangdeh, Eleonora Presani, Fabrizio Puletti, Paul Röttger, Saurav Sahay, Tim Santos, Nino Scherrer, Alice Schoenauer Sebag, Patrick Schramowski, Abolfazl Shahbazi, Vin Sharma, Xudong Shen, Vamsi Sistla, Leonard Tang, Davide Testuggine, Vithursan Thangarasa, Elizabeth Anne Watkins, Rebecca Weiss, Chris Welty, Tyler Wilbers, Adina Williams, Carole-Jean Wu, Poonam Yadav, Xianjun Yang, Yi Zeng, Wenhui Zhang, Fedor Zhdanov, Jiacheng Zhu, Percy Liang, Peter Mattson, Joaquin Vanschoren

Viaarxiv icon

Consistent Explanations in the Face of Model Indeterminacy via Ensembling

Add code
Bookmark button
Alert button
Jun 13, 2023
Dan Ley, Leonard Tang, Matthew Nazari, Hongjin Lin, Suraj Srinivas, Himabindu Lakkaraju

Figure 1 for Consistent Explanations in the Face of Model Indeterminacy via Ensembling
Figure 2 for Consistent Explanations in the Face of Model Indeterminacy via Ensembling
Figure 3 for Consistent Explanations in the Face of Model Indeterminacy via Ensembling
Figure 4 for Consistent Explanations in the Face of Model Indeterminacy via Ensembling
Viaarxiv icon

Degraded Polygons Raise Fundamental Questions of Neural Network Perception

Add code
Bookmark button
Alert button
Jun 08, 2023
Leonard Tang, Dan Ley

Figure 1 for Degraded Polygons Raise Fundamental Questions of Neural Network Perception
Figure 2 for Degraded Polygons Raise Fundamental Questions of Neural Network Perception
Figure 3 for Degraded Polygons Raise Fundamental Questions of Neural Network Perception
Figure 4 for Degraded Polygons Raise Fundamental Questions of Neural Network Perception
Viaarxiv icon

Baselines for Identifying Watermarked Large Language Models

Add code
Bookmark button
Alert button
May 29, 2023
Leonard Tang, Gavin Uberti, Tom Shlomi

Figure 1 for Baselines for Identifying Watermarked Large Language Models
Figure 2 for Baselines for Identifying Watermarked Large Language Models
Figure 3 for Baselines for Identifying Watermarked Large Language Models
Figure 4 for Baselines for Identifying Watermarked Large Language Models
Viaarxiv icon

Learning the Wrong Lessons: Inserting Trojans During Knowledge Distillation

Add code
Bookmark button
Alert button
Mar 09, 2023
Leonard Tang, Tom Shlomi, Alexander Cai

Figure 1 for Learning the Wrong Lessons: Inserting Trojans During Knowledge Distillation
Figure 2 for Learning the Wrong Lessons: Inserting Trojans During Knowledge Distillation
Figure 3 for Learning the Wrong Lessons: Inserting Trojans During Knowledge Distillation
Viaarxiv icon

MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding

Add code
Bookmark button
Alert button
Jan 06, 2023
Steven H. Wang, Antoine Scardigli, Leonard Tang, Wei Chen, Dimitry Levkin, Anya Chen, Spencer Ball, Thomas Woodside, Oliver Zhang, Dan Hendrycks

Figure 1 for MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Figure 2 for MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Figure 3 for MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Figure 4 for MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Viaarxiv icon

The Naughtyformer: A Transformer Understands Offensive Humor

Add code
Bookmark button
Alert button
Nov 25, 2022
Leonard Tang, Alexander Cai, Steve Li, Jason Wang

Figure 1 for The Naughtyformer: A Transformer Understands Offensive Humor
Figure 2 for The Naughtyformer: A Transformer Understands Offensive Humor
Figure 3 for The Naughtyformer: A Transformer Understands Offensive Humor
Figure 4 for The Naughtyformer: A Transformer Understands Offensive Humor
Viaarxiv icon

Lila: A Unified Benchmark for Mathematical Reasoning

Add code
Bookmark button
Alert button
Oct 31, 2022
Swaroop Mishra, Matthew Finlayson, Pan Lu, Leonard Tang, Sean Welleck, Chitta Baral, Tanmay Rajpurohit, Oyvind Tafjord, Ashish Sabharwal, Peter Clark, Ashwin Kalyan

Figure 1 for Lila: A Unified Benchmark for Mathematical Reasoning
Figure 2 for Lila: A Unified Benchmark for Mathematical Reasoning
Figure 3 for Lila: A Unified Benchmark for Mathematical Reasoning
Figure 4 for Lila: A Unified Benchmark for Mathematical Reasoning
Viaarxiv icon

A Dataset and Benchmark for Automatically Answering and Generating Machine Learning Final Exams

Add code
Bookmark button
Alert button
Jun 11, 2022
Sarah Zhang, Reece Shuttleworth, Derek Austin, Yann Hicke, Leonard Tang, Sathwik Karnik, Darnell Granberry, Iddo Drori

Figure 1 for A Dataset and Benchmark for Automatically Answering and Generating Machine Learning Final Exams
Figure 2 for A Dataset and Benchmark for Automatically Answering and Generating Machine Learning Final Exams
Figure 3 for A Dataset and Benchmark for Automatically Answering and Generating Machine Learning Final Exams
Figure 4 for A Dataset and Benchmark for Automatically Answering and Generating Machine Learning Final Exams
Viaarxiv icon

A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More

Add code
Bookmark button
Alert button
Jan 04, 2022
Iddo Drori, Sunny Tran, Roman Wang, Newman Cheng, Kevin Liu, Leonard Tang, Elizabeth Ke, Nikhil Singh, Taylor L. Patti, Jayson Lynch, Avi Shporer, Nakul Verma, Eugene Wu, Gilbert Strang

Figure 1 for A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More
Figure 2 for A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More
Figure 3 for A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More
Figure 4 for A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More
Viaarxiv icon