Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zach Studdiford

Reasoning as Pattern Matching: Shared Mechanisms in Human and LLM Everyday Reasoning

Jun 11, 2026

Zach Studdiford, Gary Lupyan

Abstract:When large language models (LLMs) fail to generalize or make haphazard errors in reasoning, it is often taken as evidence that LLMs are not truly reasoning, but rather performing a kind of pattern matching. The implication is that people's behavior does not exhibit the same types of failures because human reasoning uses principled and abstract world models. We evaluate human participants and 25 LLMs on their ability to engage in common-sense reasoning about a variety of everyday situations and observe similar patterns of errors in both people and models. We then identify the set of attention heads driving LLM responses and find that these heads implement a form of pattern-matching. These attention heads allow us to predict seemingly inexplicable reasoning errors in people caused by ostensibly irrelevant prompt details. Taken together, our results suggest that everyday causal reasoning in people and LLMs is more consistent with a form of pattern-matching than with abstract world models.

* 13 pages main text, 51 pages supplementary text

Via

Access Paper or Ask Questions

Evaluating Steering Techniques using Human Similarity Judgments

May 25, 2025

Zach Studdiford, Timothy T. Rogers, Siddharth Suresh, Kushin Mukherjee

Figure 1 for Evaluating Steering Techniques using Human Similarity Judgments

Figure 2 for Evaluating Steering Techniques using Human Similarity Judgments

Figure 3 for Evaluating Steering Techniques using Human Similarity Judgments

Figure 4 for Evaluating Steering Techniques using Human Similarity Judgments

Abstract:Current evaluations of Large Language Model (LLM) steering techniques focus on task-specific performance, overlooking how well steered representations align with human cognition. Using a well-established triadic similarity judgment task, we assessed steered LLMs on their ability to flexibly judge similarity between concepts based on size or kind. We found that prompt-based steering methods outperformed other methods both in terms of steering accuracy and model-to-human alignment. We also found LLMs were biased towards 'kind' similarity and struggled with 'size' alignment. This evaluation approach, grounded in human cognition, adds further support to the efficacy of prompt-based steering and reveals privileged representational axes in LLMs prior to steering.

Via

Access Paper or Ask Questions

Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks

Jun 25, 2024

Yun-Shiuan Chuang, Zach Studdiford, Krirk Nirunwiroj, Agam Goyal, Vincent V. Frigo, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

Figure 1 for Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks

Figure 2 for Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks

Figure 3 for Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks

Figure 4 for Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks

Abstract:Creating human-like large language model (LLM) agents is crucial for faithful social simulation. Having LLMs role-play based on demographic information sometimes improves human likeness but often does not. This study assessed whether LLM alignment with human behavior can be improved by integrating information from empirically-derived human belief networks. Using data from a human survey, we estimated a belief network encompassing 18 topics loading on two non-overlapping latent factors. We then seeded LLM-based agents with an opinion on one topic, and assessed the alignment of its expressed opinions on remaining test topics with corresponding human data. Role-playing based on demographic information alone did not align LLM and human opinions, but seeding the agent with a single belief greatly improved alignment for topics related in the belief network, and not for topics outside the network. These results suggest a novel path for human-LLM belief alignment in work seeking to simulate and understand patterns of belief distributions in society.

Via

Access Paper or Ask Questions