Picture for Joschka Braun

Joschka Braun

Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization

Add code
May 30, 2025
Viaarxiv icon

Understanding (Un)Reliability of Steering Vectors in Language Models

Add code
May 28, 2025
Figure 1 for Understanding (Un)Reliability of Steering Vectors in Language Models
Figure 2 for Understanding (Un)Reliability of Steering Vectors in Language Models
Figure 3 for Understanding (Un)Reliability of Steering Vectors in Language Models
Figure 4 for Understanding (Un)Reliability of Steering Vectors in Language Models
Viaarxiv icon