Picture for Spencer Kim

Spencer Kim

Reasoning Relay: Evaluating Stability and Interchangeability of Large Language Models in Mathematical Reasoning

Add code
Dec 16, 2025
Viaarxiv icon

WOLF: Werewolf-based Observations for LLM Deception and Falsehoods

Add code
Dec 09, 2025
Viaarxiv icon