Alert button
Picture for Craig Swift

Craig Swift

Alert button

Testing Language Model Agents Safely in the Wild

Add code
Bookmark button
Alert button
Dec 03, 2023
Silen Naihin, David Atkinson, Marc Green, Merwane Hamadi, Craig Swift, Douglas Schonholtz, Adam Tauman Kalai, David Bau

Figure 1 for Testing Language Model Agents Safely in the Wild
Figure 2 for Testing Language Model Agents Safely in the Wild
Figure 3 for Testing Language Model Agents Safely in the Wild
Figure 4 for Testing Language Model Agents Safely in the Wild
Viaarxiv icon

GAIA: a benchmark for General AI Assistants

Add code
Bookmark button
Alert button
Nov 21, 2023
Grégoire Mialon, Clémentine Fourrier, Craig Swift, Thomas Wolf, Yann LeCun, Thomas Scialom

Viaarxiv icon