Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stefanos Nikolaidis

Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models

Apr 04, 2025

Siddharth Srikanth, Varun Bhatt, Boshen Zhang, Werner Hager, Charles Michael Lewis, Katia P. Sycara, Aaquib Tabrez, Stefanos Nikolaidis

Figure 1 for Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models

Figure 2 for Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models

Figure 3 for Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models

Figure 4 for Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models

Abstract:Understanding how humans collaborate and communicate in teams is essential for improving human-agent teaming and AI-assisted decision-making. However, relying solely on data from large-scale user studies is impractical due to logistical, ethical, and practical constraints, necessitating synthetic models of multiple diverse human behaviors. Recently, agents powered by Large Language Models (LLMs) have been shown to emulate human-like behavior in social settings. But, obtaining a large set of diverse behaviors requires manual effort in the form of designing prompts. On the other hand, Quality Diversity (QD) optimization has been shown to be capable of generating diverse Reinforcement Learning (RL) agent behavior. In this work, we combine QD optimization with LLM-powered agents to iteratively search for prompts that generate diverse team behavior in a long-horizon, multi-step collaborative environment. We first show, through a human-subjects experiment (n=54 participants), that humans exhibit diverse coordination and communication behavior in this domain. We then show that our approach can effectively replicate trends from human teaming data and also capture behaviors that are not easily observed without collecting large amounts of data. Our findings highlight the combination of QD and LLM-powered agents as an effective tool for studying teaming and communication strategies in multi-agent collaboration.

Via

Access Paper or Ask Questions

Soft and Compliant Contact-Rich Hair Manipulation and Care

Jan 05, 2025

Uksang Yoo, Nathaniel Dennler, Eliot Xing, Maja Matarić, Stefanos Nikolaidis, Jeffrey Ichnowski, Jean Oh

Figure 1 for Soft and Compliant Contact-Rich Hair Manipulation and Care

Figure 2 for Soft and Compliant Contact-Rich Hair Manipulation and Care

Figure 3 for Soft and Compliant Contact-Rich Hair Manipulation and Care

Figure 4 for Soft and Compliant Contact-Rich Hair Manipulation and Care

Abstract:Hair care robots can help address labor shortages in elderly care while enabling those with limited mobility to maintain their hair-related identity. We present MOE-Hair, a soft robot system that performs three hair-care tasks: head patting, finger combing, and hair grasping. The system features a tendon-driven soft robot end-effector (MOE) with a wrist-mounted RGBD camera, leveraging both mechanical compliance for safety and visual force sensing through deformation. In testing with a force-sensorized mannequin head, MOE achieved comparable hair-grasping effectiveness while applying significantly less force than rigid grippers. Our novel force estimation method combines visual deformation data and tendon tensions from actuators to infer applied forces, reducing sensing errors by up to 60.1% and 20.3% compared to actuator current load-only and depth image-only baselines, respectively. A user study with 12 participants demonstrated statistically significant preferences for MOE-Hair over a baseline system in terms of comfort, effectiveness, and appropriate force application. These results demonstrate the unique advantages of soft robots in contact-rich hair-care tasks, while highlighting the importance of precise force control despite the inherent compliance of the system.

Via

Access Paper or Ask Questions

Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation

Jan 02, 2025

Nathaniel Dennler, Stefanos Nikolaidis, Maja Matarić

Figure 1 for Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation

Figure 2 for Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation

Figure 3 for Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation

Figure 4 for Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation

Abstract:People have a variety of preferences for how robots behave. To understand and reason about these preferences, robots aim to learn a reward function that describes how aligned robot behaviors are with a user's preferences. Good representations of a robot's behavior can significantly reduce the time and effort required for a user to teach the robot their preferences. Specifying these representations -- what "features" of the robot's behavior matter to users -- remains a difficult problem; Features learned from raw data lack semantic meaning and features learned from user data require users to engage in tedious labeling processes. Our key insight is that users tasked with customizing a robot are intrinsically motivated to produce labels through exploratory search; they explore behaviors that they find interesting and ignore behaviors that are irrelevant. To harness this novel data source of exploratory actions, we propose contrastive learning from exploratory actions (CLEA) to learn trajectory features that are aligned with features that users care about. We learned CLEA features from exploratory actions users performed in an open-ended signal design activity (N=25) with a Kuri robot, and evaluated CLEA features through a second user study with a different set of users (N=42). CLEA features outperformed self-supervised features when eliciting user preferences over four metrics: completeness, simplicity, minimality, and explainability.

* Accepted to HRI 2025

Via

Access Paper or Ask Questions

Improving User Experience in Preference-Based Optimization of Reward Functions for Assistive Robots

Nov 17, 2024

Nathaniel Dennler, Zhonghao Shi, Stefanos Nikolaidis, Maja Matarić

Figure 1 for Improving User Experience in Preference-Based Optimization of Reward Functions for Assistive Robots

Figure 2 for Improving User Experience in Preference-Based Optimization of Reward Functions for Assistive Robots

Figure 3 for Improving User Experience in Preference-Based Optimization of Reward Functions for Assistive Robots

Figure 4 for Improving User Experience in Preference-Based Optimization of Reward Functions for Assistive Robots

Abstract:Assistive robots interact with humans and must adapt to different users' preferences to be effective. An easy and effective technique to learn non-expert users' preferences is through rankings of robot behaviors, for example, robot movement trajectories or gestures. Existing techniques focus on generating trajectories for users to rank that maximize the outcome of the preference learning process. However, the generated trajectories do not appear to reflect the user's preference over repeated interactions. In this work, we design an algorithm to generate trajectories for users to rank that we call Covariance Matrix Adaptation Evolution Strategies with Information Gain (CMA-ES-IG). CMA-ES-IG prioritizes the user's experience of the preference learning process. We show that users find our algorithm more intuitive and easier to use than previous approaches across both physical and social robot tasks. This project's code is hosted at github.com/interaction-lab/CMA-ES-IG

* Accepted to ISRR

Via

Access Paper or Ask Questions

Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity

Nov 07, 2024

Robby Costales, Stefanos Nikolaidis

Abstract:The wider application of end-to-end learning methods to embodied decision-making domains remains bottlenecked by their reliance on a superabundance of training data representative of the target domain. Meta-reinforcement learning (meta-RL) approaches abandon the aim of zero-shot generalization--the goal of standard reinforcement learning (RL)--in favor of few-shot adaptation, and thus hold promise for bridging larger generalization gaps. While learning this meta-level adaptive behavior still requires substantial data, efficient environment simulators approaching real-world complexity are growing in prevalence. Even so, hand-designing sufficiently diverse and numerous simulated training tasks for these complex domains is prohibitively labor-intensive. Domain randomization (DR) and procedural generation (PG), offered as solutions to this problem, require simulators to possess carefully-defined parameters which directly translate to meaningful task diversity--a similarly prohibitive assumption. In this work, we present DIVA, an evolutionary approach for generating diverse training tasks in such complex, open-ended simulators. Like unsupervised environment design (UED) methods, DIVA can be applied to arbitrary parameterizations, but can additionally incorporate realistically-available domain knowledge--thus inheriting the flexibility and generality of UED, and the supervised structure embedded in well-designed simulators exploited by DR and PG. Our empirical results showcase DIVA's unique ability to overcome complex parameterizations and successfully train adaptive agent behavior, far outperforming competitive baselines from prior literature. These findings highlight the potential of such semi-supervised environment design (SSED) approaches, of which DIVA is the first humble constituent, to enable training in realistic simulated domains, and produce more robust and capable adaptive agents.

* NeurIPS 2024

Via

Access Paper or Ask Questions

Algorithmic Scenario Generation as Quality Diversity Optimization

Sep 07, 2024

Stefanos Nikolaidis

Figure 1 for Algorithmic Scenario Generation as Quality Diversity Optimization

Figure 2 for Algorithmic Scenario Generation as Quality Diversity Optimization

Figure 3 for Algorithmic Scenario Generation as Quality Diversity Optimization

Figure 4 for Algorithmic Scenario Generation as Quality Diversity Optimization

Abstract:The increasing complexity of robots and autonomous agents that interact with people highlights the critical need for approaches that systematically test them before deployment. This review paper presents a general framework for solving this problem, describes the insights that we have gained from working on each component of the framework, and shows how integrating these components leads to the discovery of a diverse range of realistic and challenging scenarios that reveal previously unknown failures in deployed robotic systems interacting with people.

Via

Access Paper or Ask Questions

GPT-Fabric: Folding and Smoothing Fabric by Leveraging Pre-Trained Foundation Models

Jun 14, 2024

Vedant Raval, Enyu Zhao, Hejia Zhang, Stefanos Nikolaidis, Daniel Seita

Figure 1 for GPT-Fabric: Folding and Smoothing Fabric by Leveraging Pre-Trained Foundation Models

Figure 2 for GPT-Fabric: Folding and Smoothing Fabric by Leveraging Pre-Trained Foundation Models

Figure 3 for GPT-Fabric: Folding and Smoothing Fabric by Leveraging Pre-Trained Foundation Models

Figure 4 for GPT-Fabric: Folding and Smoothing Fabric by Leveraging Pre-Trained Foundation Models

Abstract:Fabric manipulation has applications in folding blankets, handling patient clothing, and protecting items with covers. It is challenging for robots to perform fabric manipulation since fabrics have infinite-dimensional configuration spaces, complex dynamics, and may be in folded or crumpled configurations with severe self-occlusions. Prior work on robotic fabric manipulation relies either on heavily engineered setups or learning-based approaches that create and train on robot-fabric interaction data. In this paper, we propose GPT-Fabric for the canonical tasks of fabric folding and smoothing, where GPT directly outputs an action informing a robot where to grasp and pull a fabric. We perform extensive experiments in simulation to test GPT-Fabric against prior state of the art methods for folding and smoothing. We obtain comparable or better performance to most methods even without explicitly training on a fabric-specific dataset (i.e., zero-shot manipulation). Furthermore, we apply GPT-Fabric in physical experiments over 12 folding and 10 smoothing rollouts. Our results suggest that GPT-Fabric is a promising approach for high-precision fabric manipulation tasks.

* Code, prompts, and videos are available at https://tinyurl.com/gptfab

Via

Access Paper or Ask Questions

Designing Robot Identity: The Role of Voice, Clothing, and Task on Robot Gender Perception

Mar 30, 2024

Nathaniel S. Dennler, Mina Kian, Stefanos Nikolaidis, Maja Matarić

Figure 1 for Designing Robot Identity: The Role of Voice, Clothing, and Task on Robot Gender Perception

Figure 2 for Designing Robot Identity: The Role of Voice, Clothing, and Task on Robot Gender Perception

Figure 3 for Designing Robot Identity: The Role of Voice, Clothing, and Task on Robot Gender Perception

Figure 4 for Designing Robot Identity: The Role of Voice, Clothing, and Task on Robot Gender Perception

Abstract:Perceptions of gender are a significant aspect of human-human interaction, and gender has wide-reaching social implications for robots deployed in contexts where they are expected to interact with humans. This work explored two flexible modalities for communicating gender in robots--voice and appearance--and we studied their individual and combined influences on a robot's perceived gender. We evaluated the perception of a robot's gender through three video-based studies. First, we conducted a study (n=65) on the gender perception of robot voices by varying speaker identity and pitch. Second, we conducted a study (n=93) on the gender perception of robot clothing designed for two different tasks. Finally, building on the results of the first two studies, we completed a large integrative video-based study (n=273) involving two human-robot interaction tasks. We found that voice and clothing can be used to reliably establish a robot's perceived gender, and that combining these two modalities can have different effects on the robot's perceived gender. Taken together, these results inform the design of robot voices and clothing as individual and interacting components in the perceptions of robot gender.

Via

Access Paper or Ask Questions

Using Causal Trees to Estimate Personalized Task Difficulty in Post-Stroke Individuals

Mar 06, 2024

Nathaniel Dennler, Stefanos Nikolaidis, Maja Matarić

Figure 1 for Using Causal Trees to Estimate Personalized Task Difficulty in Post-Stroke Individuals

Figure 2 for Using Causal Trees to Estimate Personalized Task Difficulty in Post-Stroke Individuals

Abstract:Adaptive training programs are crucial for recovery post stroke. However, developing programs that automatically adapt depends on quantifying how difficult a task is for a specific individual at a particular stage of their recovery. In this work, we propose a method that automatically generates regions of different task difficulty levels based on an individual's performance. We show that this technique explains the variance in user performance for a reaching task better than previous approaches to estimating task difficulty.

* Accepted to the 2023 IROS Workshop on Assistive Robots for Citizens

Via

Access Paper or Ask Questions

Guidance Graph Optimization for Lifelong Multi-Agent Path Finding

Feb 02, 2024

Yulun Zhang, He Jiang, Varun Bhatt, Stefanos Nikolaidis, Jiaoyang Li

Abstract:We study how to use guidance to improve the throughput of lifelong Multi-Agent Path Finding (MAPF). Previous studies have demonstrated that while incorporating guidance, such as highways, can accelerate MAPF algorithms, this often results in a trade-off with solution quality. In addition, how to generate good guidance automatically remains largely unexplored, with current methods falling short of surpassing manually designed ones. In this work, we introduce the directed guidance graph as a versatile representation of guidance for lifelong MAPF, framing Guidance Graph Optimization (GGO) as the task of optimizing its edge weights. We present two GGO algorithms to automatically generate guidance for arbitrary lifelong MAPF algorithms and maps. The first method directly solves GGO by employing CMA-ES, a black-box optimization algorithm. The second method, PIU, optimizes an update model capable of generating guidance, demonstrating the ability to transfer optimized guidance graphs to larger maps with similar layouts. Empirically, we show that (1) our guidance graphs improve the throughput of three representative lifelong MAPF algorithms in four benchmark maps, and (2) our update model can generate guidance graphs for as large as $93 \times 91$ maps and as many as 3000 agents.

Via

Access Paper or Ask Questions