Picture for Sarah Ball

Sarah Ball

On the Impossibility of Separating Intelligence from Judgment: The Computational Intractability of Filtering for AI Alignment

Add code
Jul 09, 2025
Viaarxiv icon

Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction

Add code
Feb 22, 2025
Figure 1 for Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction
Figure 2 for Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction
Figure 3 for Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction
Figure 4 for Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction
Viaarxiv icon

Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models

Add code
Jun 13, 2024
Figure 1 for Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models
Figure 2 for Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models
Figure 3 for Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models
Figure 4 for Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models
Viaarxiv icon

Seeing ChatGPT Through Students' Eyes: An Analysis of TikTok Data

Add code
Mar 09, 2023
Viaarxiv icon