Alert button
Picture for Karim Elmaaroufi

Karim Elmaaroufi

Alert button

$L^*LM$: Learning Automata from Examples using Natural Language Oracles

Add code
Bookmark button
Alert button
Feb 10, 2024
Marcell Vazquez-Chanlatte, Karim Elmaaroufi, Stefan J. Witwicki, Sanjit A. Seshia

Viaarxiv icon

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

Add code
Bookmark button
Alert button
Nov 02, 2023
Sam Toyer, Olivia Watkins, Ethan Adrian Mendes, Justin Svegliato, Luke Bailey, Tiffany Wang, Isaac Ong, Karim Elmaaroufi, Pieter Abbeel, Trevor Darrell, Alan Ritter, Stuart Russell

Viaarxiv icon