Picture for Alexander von Recum

Alexander von Recum

Are Reasoning LLMs Robust to Interventions on Their Chain-of-Thought?

Add code
Feb 07, 2026
Viaarxiv icon

Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs

Add code
Dec 22, 2024
Viaarxiv icon