Picture for Leon Eshuijs

Leon Eshuijs

But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors

Add code
May 23, 2025
Viaarxiv icon

Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification

Add code
May 09, 2025
Viaarxiv icon

Balancing the Scales: Reinforcement Learning for Fair Classification

Add code
Jul 15, 2024
Figure 1 for Balancing the Scales: Reinforcement Learning for Fair Classification
Figure 2 for Balancing the Scales: Reinforcement Learning for Fair Classification
Figure 3 for Balancing the Scales: Reinforcement Learning for Fair Classification
Figure 4 for Balancing the Scales: Reinforcement Learning for Fair Classification
Viaarxiv icon