Picture for Arushi Somani

Arushi Somani

Unsupervised Elicitation of Language Models

Add code
Jun 11, 2025
Figure 1 for Unsupervised Elicitation of Language Models
Figure 2 for Unsupervised Elicitation of Language Models
Figure 3 for Unsupervised Elicitation of Language Models
Figure 4 for Unsupervised Elicitation of Language Models
Viaarxiv icon

Reasoning Models Don't Always Say What They Think

Add code
May 08, 2025
Figure 1 for Reasoning Models Don't Always Say What They Think
Figure 2 for Reasoning Models Don't Always Say What They Think
Figure 3 for Reasoning Models Don't Always Say What They Think
Figure 4 for Reasoning Models Don't Always Say What They Think
Viaarxiv icon

Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions

Add code
Apr 21, 2025
Figure 1 for Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions
Figure 2 for Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions
Figure 3 for Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions
Figure 4 for Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions
Viaarxiv icon