Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Brian Scassellati

The Attentional White Bear Effect in Transformer Language Models

May 27, 2026

Rebecca Ramnauth, Brian Scassellati

Abstract:Instruction-based suppression is widely used to prevent language models from generating prohibited content, yet it remains unclear whether suppression reduces internal representation or merely suppresses expression. We investigate this question through representational probing, attention analysis, and behavioral semantic leakage experiments across multiple transformer models. We find that prohibited concepts remain highly recoverable from hidden representations under suppression, continue to influence attention routing, and measurably shape downstream generations despite successful lexical avoidance. These effects persist across pooling strategies, indirect semantic controls, and multiple model families. Our results expose a fundamental gap between behavioral and representational alignment.

* Currently under review at EMNLP 2026

Via

Access Paper or Ask Questions

Robotics-Inspired Guardrails for Foundation Models in Socially Sensitive Domains

May 19, 2026

Rebecca Ramnauth, Drazen Brscic, Brian Scassellati

Abstract:Foundation models are increasingly deployed in socially sensitive domains such as education, mental health, and caregiving, where failures are often cumulative and context-dependent. Existing guardrail approaches -- ranging from training-time alignment to prompting, decoding constraints, and post-hoc moderation -- primarily provide empirical risk reduction rather than enforceable behavioral guarantees, and largely treat safety as a property of individual outputs rather than interaction trajectories. We reframe guardrails as a problem of runtime behavioral control over interaction trajectories, drawing on robotics to introduce formal constructs for constraint enforcement in uncertain, closed-loop systems. We instantiate these ideas in the Grounded Observer framework and apply it across three real-world deployments: small talk, in-home autism therapy, and behavioral de-escalation in schools. Across settings, the framework enables runtime interventions that mitigate drift into undesirable interaction regimes while adapting to diverse social contexts. We discuss extensions to the framework and propose research directions toward stronger guarantees.

* Under review at Journal of Artificial Intelligence Research (JAIR)

Via

Access Paper or Ask Questions

Towards Zero-Knowledge Task Planning via a Language-based Approach

Jan 06, 2026

Liam Merz Hoffmeister, Brian Scassellati, Daniel Rakita

Abstract:In this work, we introduce and formalize the Zero-Knowledge Task Planning (ZKTP) problem, i.e., formulating a sequence of actions to achieve some goal without task-specific knowledge. Additionally, we present a first investigation and approach for ZKTP that leverages a large language model (LLM) to decompose natural language instructions into subtasks and generate behavior trees (BTs) for execution. If errors arise during task execution, the approach also uses an LLM to adjust the BTs on-the-fly in a refinement loop. Experimental validation in the AI2-THOR simulator demonstrate our approach's effectiveness in improving overall task performance compared to alternative approaches that leverage task-specific knowledge. Our work demonstrates the potential of LLMs to effectively address several aspects of the ZKTP problem, providing a robust framework for automated behavior generation with no task-specific setup.

Via

Access Paper or Ask Questions

A Robot-Assisted Approach to Small Talk Training for Adults with ASD

May 29, 2025

Rebecca Ramnauth, Dražen Brščić, Brian Scassellati

Figure 1 for A Robot-Assisted Approach to Small Talk Training for Adults with ASD

Figure 2 for A Robot-Assisted Approach to Small Talk Training for Adults with ASD

Figure 3 for A Robot-Assisted Approach to Small Talk Training for Adults with ASD

Figure 4 for A Robot-Assisted Approach to Small Talk Training for Adults with ASD

Abstract:From dating to job interviews, making new friends or simply chatting with the cashier at checkout, engaging in small talk is a vital, everyday social skill. For adults with Autism Spectrum Disorder (ASD), small talk can be particularly challenging, yet it is essential for social integration, building relationships, and accessing professional opportunities. In this study, we present our development and evaluation of an in-home autonomous robot system that allows users to practice small talk. Results from the week-long study show that adults with ASD enjoyed the training, made notable progress in initiating conversations and improving eye contact, and viewed the system as a valuable tool for enhancing their conversational skills.

* Accepted for publication in Robotics: Science and Systems (RSS) 2025, 14 pages, 4 figures,

Via

Access Paper or Ask Questions

Effects of Robot Competency and Motion Legibility on Human Correction Feedback

Jan 07, 2025

Shuangge Wang, Anjiabei Wang, Sofiya Goncharova, Brian Scassellati, Tesca Fitzgerald

Figure 1 for Effects of Robot Competency and Motion Legibility on Human Correction Feedback

Figure 2 for Effects of Robot Competency and Motion Legibility on Human Correction Feedback

Figure 3 for Effects of Robot Competency and Motion Legibility on Human Correction Feedback

Figure 4 for Effects of Robot Competency and Motion Legibility on Human Correction Feedback

Abstract:As robot deployments become more commonplace, people are likely to take on the role of supervising robots (i.e., correcting their mistakes) rather than directly teaching them. Prior works on Learning from Corrections (LfC) have relied on three key assumptions to interpret human feedback: (1) people correct the robot only when there is significant task objective divergence; (2) people can accurately predict if a correction is necessary; and (3) people trade off precision and physical effort when giving corrections. In this work, we study how two key factors (robot competency and motion legibility) affect how people provide correction feedback and their implications on these existing assumptions. We conduct a user study ($N=60$) under an LfC setting where participants supervise and correct a robot performing pick-and-place tasks. We find that people are more sensitive to suboptimal behavior by a highly competent robot compared to an incompetent robot when the motions are legible ($p=0.0015$) and predictable ($p=0.0055$). In addition, people also tend to withhold necessary corrections ($p < 0.0001$) when supervising an incompetent robot and are more prone to offering unnecessary ones ($p = 0.0171$) when supervising a highly competent robot. We also find that physical effort positively correlates with correction precision, providing empirical evidence to support this common assumption. We also find that this correlation is significantly weaker for an incompetent robot with legible motions than an incompetent robot with predictable motions ($p = 0.0075$). Our findings offer insights for accounting for competency and legibility when designing robot interaction behaviors and learning task objectives from corrections.

* to be published in the 2025 ACM/IEEE International Conference on Human-Robot Interaction (HRI)

Via

Access Paper or Ask Questions

Gaze Behavior During a Long-Term, In-Home, Social Robot Intervention for Children with ASD

Jan 05, 2025

Rebecca Ramnauth, Frederick Shic, Brian Scassellati

Figure 1 for Gaze Behavior During a Long-Term, In-Home, Social Robot Intervention for Children with ASD

Figure 2 for Gaze Behavior During a Long-Term, In-Home, Social Robot Intervention for Children with ASD

Figure 3 for Gaze Behavior During a Long-Term, In-Home, Social Robot Intervention for Children with ASD

Figure 4 for Gaze Behavior During a Long-Term, In-Home, Social Robot Intervention for Children with ASD

Abstract:Atypical gaze behavior is a diagnostic hallmark of Autism Spectrum Disorder (ASD), playing a substantial role in the social and communicative challenges that individuals with ASD face. This study explores the impacts of a month-long, in-home intervention designed to promote triadic interactions between a social robot, a child with ASD, and their caregiver. Our results indicate that the intervention successfully promoted appropriate gaze behavior, encouraging children with ASD to follow the robot's gaze, resulting in more frequent and prolonged instances of spontaneous eye contact and joint attention with their caregivers. Additionally, we observed specific timelines for behavioral variability and novelty effects among users. Furthermore, diagnostic measures for ASD emerged as strong predictors of gaze patterns for both caregivers and children. These results deepen our understanding of ASD gaze patterns and highlight the potential for clinical relevance of robot-assisted interventions.

* Accepted for publication at the 2025 20th IEEE/ACM International Conference on Human-Robot Interaction (HRI)

Via

Access Paper or Ask Questions

More than Chit-Chat: Developing Robots for Small-Talk Interactions

Dec 23, 2024

Rebecca Ramnauth, Dražen Brščić, Brian Scassellati

Figure 1 for More than Chit-Chat: Developing Robots for Small-Talk Interactions

Figure 2 for More than Chit-Chat: Developing Robots for Small-Talk Interactions

Figure 3 for More than Chit-Chat: Developing Robots for Small-Talk Interactions

Figure 4 for More than Chit-Chat: Developing Robots for Small-Talk Interactions

Abstract:Beyond mere formality, small talk plays a pivotal role in social dynamics, serving as a verbal handshake for building rapport and understanding. For conversational AI and social robots, the ability to engage in small talk enhances their perceived sociability, leading to more comfortable and natural user interactions. In this study, we evaluate the capacity of current Large Language Models (LLMs) to drive the small talk of a social robot and identify key areas for improvement. We introduce a novel method that autonomously generates feedback and ensures LLM-generated responses align with small talk conventions. Through several evaluations -- involving chatbot interactions and human-robot interactions -- we demonstrate the system's effectiveness in guiding LLM-generated responses toward realistic, human-like, and natural small-talk exchanges.

Via

Access Paper or Ask Questions

A Grounded Observer Framework for Establishing Guardrails for Foundation Models in Socially Sensitive Domains

Dec 23, 2024

Rebecca Ramnauth, Dražen Brščić, Brian Scassellati

Figure 1 for A Grounded Observer Framework for Establishing Guardrails for Foundation Models in Socially Sensitive Domains

Figure 2 for A Grounded Observer Framework for Establishing Guardrails for Foundation Models in Socially Sensitive Domains

Figure 3 for A Grounded Observer Framework for Establishing Guardrails for Foundation Models in Socially Sensitive Domains

Figure 4 for A Grounded Observer Framework for Establishing Guardrails for Foundation Models in Socially Sensitive Domains

Abstract:As foundation models increasingly permeate sensitive domains such as healthcare, finance, and mental health, ensuring their behavior meets desired outcomes and social expectations becomes critical. Given the complexities of these high-dimensional models, traditional techniques for constraining agent behavior, which typically rely on low-dimensional, discrete state and action spaces, cannot be directly applied. Drawing inspiration from robotic action selection techniques, we propose the grounded observer framework for constraining foundation model behavior that offers both behavioral guarantees and real-time variability. This method leverages real-time assessment of low-level behavioral characteristics to dynamically adjust model actions and provide contextual feedback. To demonstrate this, we develop a system capable of sustaining contextually appropriate, casual conversations ("small talk"), which we then apply to a robot for novel, unscripted interactions with humans. Finally, we discuss potential applications of the framework for other social contexts and areas for further research.

* arXiv admin note: text overlap with arXiv:2412.18023

Via

Access Paper or Ask Questions

Sequential Discrete Action Selection via Blocking Conditions and Resolutions

Sep 12, 2024

Liam Merz Hoffmeister, Brian Scassellati, Daniel Rakita

Figure 1 for Sequential Discrete Action Selection via Blocking Conditions and Resolutions

Figure 2 for Sequential Discrete Action Selection via Blocking Conditions and Resolutions

Figure 3 for Sequential Discrete Action Selection via Blocking Conditions and Resolutions

Figure 4 for Sequential Discrete Action Selection via Blocking Conditions and Resolutions

Abstract:In this work, we introduce a strategy that frames the sequential action selection problem for robots in terms of resolving \textit{blocking conditions}, i.e., situations that impede progress on an action en route to a goal. This strategy allows a robot to make one-at-a-time decisions that take in pertinent contextual information and swiftly adapt and react to current situations. We present a first instantiation of this strategy that combines a state-transition graph and a zero-shot Large Language Model (LLM). The state-transition graph tracks which previously attempted actions are currently blocked and which candidate actions may resolve existing blocking conditions. This information from the state-transition graph is used to automatically generate a prompt for the LLM, which then uses the given context and set of possible actions to select a single action to try next. This selection process is iterative, with each chosen and executed action further refining the state-transition graph, continuing until the agent either fulfills the goal or encounters a termination condition. We demonstrate the effectiveness of our approach by comparing it to various LLM and traditional task-planning methods in a testbed of simulation experiments. We discuss the implications of our work based on our results.

Via

Access Paper or Ask Questions

REACT: Two Datasets for Analyzing Both Human Reactions and Evaluative Feedback to Robots Over Time

Jan 31, 2024

Kate Candon, Nicholas C. Georgiou, Helen Zhou, Sidney Richardson, Qiping Zhang, Brian Scassellati, Marynel Vázquez

Figure 1 for REACT: Two Datasets for Analyzing Both Human Reactions and Evaluative Feedback to Robots Over Time

Figure 2 for REACT: Two Datasets for Analyzing Both Human Reactions and Evaluative Feedback to Robots Over Time

Figure 3 for REACT: Two Datasets for Analyzing Both Human Reactions and Evaluative Feedback to Robots Over Time

Abstract:Recent work in Human-Robot Interaction (HRI) has shown that robots can leverage implicit communicative signals from users to understand how they are being perceived during interactions. For example, these signals can be gaze patterns, facial expressions, or body motions that reflect internal human states. To facilitate future research in this direction, we contribute the REACT database, a collection of two datasets of human-robot interactions that display users' natural reactions to robots during a collaborative game and a photography scenario. Further, we analyze the datasets to show that interaction history is an important factor that can influence human reactions to robots. As a result, we believe that future models for interpreting implicit feedback in HRI should explicitly account for this history. REACT opens up doors to this possibility in the future.

Via

Access Paper or Ask Questions