Picture for Antske Fokkens

Antske Fokkens

Improving Causal Interventions in Amnesic Probing with Mean Projection or LEACE

Add code
Jun 13, 2025
Viaarxiv icon

Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification

Add code
May 09, 2025
Viaarxiv icon

DefVerify: Do Hate Speech Models Reflect Their Dataset's Definition?

Add code
Oct 21, 2024
Viaarxiv icon

Crowd-Calibrator: Can Annotator Disagreement Inform Calibration in Subjective Tasks?

Add code
Aug 26, 2024
Viaarxiv icon

Balancing the Scales: Reinforcement Learning for Fair Classification

Add code
Jul 15, 2024
Viaarxiv icon

ARM: Efficient Guided Decoding with Autoregressive Reward Models

Add code
Jul 05, 2024
Viaarxiv icon

Investigating the Robustness of Modelling Decisions for Few-Shot Cross-Topic Stance Detection: A Preregistered Study

Add code
Apr 05, 2024
Viaarxiv icon

The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement

Add code
Mar 28, 2024
Viaarxiv icon

Dynamic Top-k Estimation Consolidates Disagreement between Feature Attribution Methods

Add code
Oct 09, 2023
Viaarxiv icon

Improving and Evaluating the Detection of Fragmentation in News Recommendations with the Clustering of News Story Chains

Add code
Sep 18, 2023
Figure 1 for Improving and Evaluating the Detection of Fragmentation in News Recommendations with the Clustering of News Story Chains
Figure 2 for Improving and Evaluating the Detection of Fragmentation in News Recommendations with the Clustering of News Story Chains
Figure 3 for Improving and Evaluating the Detection of Fragmentation in News Recommendations with the Clustering of News Story Chains
Viaarxiv icon