Picture for Hinrich Schütze

Hinrich Schütze

Shammie

Hateful Person or Hateful Model? Investigating the Role of Personas in Hate Speech Detection by Large Language Models

Add code
Jun 10, 2025
Viaarxiv icon

From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents

Add code
May 29, 2025
Viaarxiv icon

Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning

Add code
May 28, 2025
Viaarxiv icon

Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing

Add code
May 27, 2025
Viaarxiv icon

Why Do More Experts Fail? A Theoretical Analysis of Model Merging

Add code
May 27, 2025
Viaarxiv icon

Understanding Gated Neurons in Transformers from Their Input-Output Functionality

Add code
May 23, 2025
Viaarxiv icon

Refusal Direction is Universal Across Safety-Aligned Languages

Add code
May 22, 2025
Viaarxiv icon

Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models

Add code
May 22, 2025
Viaarxiv icon

Tracing Multilingual Factual Knowledge Acquisition in Pretraining

Add code
May 20, 2025
Viaarxiv icon

Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability

Add code
May 20, 2025
Viaarxiv icon