Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Akansel Cosgun

A Service Robot's Guide to Interacting with Busy Customers

Dec 19, 2025

Suraj Nukala, Meera Sushma, Leimin Tian, Akansel Cosgun, Dana Kulic

Abstract:The growing use of service robots in hospitality highlights the need to understand how to effectively communicate with pre-occupied customers. This study investigates the efficacy of commonly used communication modalities by service robots, namely, acoustic/speech, visual display, and micromotion gestures in capturing attention and communicating intention with a user in a simulated restaurant scenario. We conducted a two-part user study (N=24) using a Temi robot to simulate delivery tasks, with participants engaged in a typing game (MonkeyType) to emulate a state of busyness. The participants' engagement in the typing game is measured by words per minute (WPM) and typing accuracy. In Part 1, we compared non-verbal acoustic cue versus baseline conditions to assess attention capture during a single-cup delivery task. In Part 2, we evaluated the effectiveness of speech, visual display, micromotion and their multimodal combination in conveying specific intentions (correct cup selection) during a two-cup delivery task. The results indicate that, while speech is highly effective in capturing attention, it is less successful in clearly communicating intention. Participants rated visual as the most effective modality for intention clarity, followed by speech, with micromotion being the lowest ranked.These findings provide insights into optimizing communication strategies for service robots, highlighting the distinct roles of attention capture and intention communication in enhancing user experience in dynamic hospitality settings.

* Proceedings of the 2025 Australasian Conference on Robotics and Automation (ACRA 2025)
* Presented at ACRA 2025. 10 pages, 4 figures. Includes a user study (N=24) using the Temi robot evaluating speech, visual, and micromotion modalities

Via

Access Paper or Ask Questions

Virtual Traffic Lights for Multi-Robot Navigation: Decentralized Planning with Centralized Conflict Resolution

Nov 11, 2025

Sagar Gupta, Thanh Vinh Nguyen, Thieu Long Phan, Vidul Attri, Archit Gupta, Niroshinie Fernando, Kevin Lee, Seng W. Loke, Ronny Kutadinata, Benjamin Champion(+1 more)

Abstract:We present a hybrid multi-robot coordination framework that combines decentralized path planning with centralized conflict resolution. In our approach, each robot autonomously plans its path and shares this information with a centralized node. The centralized system detects potential conflicts and allows only one of the conflicting robots to proceed at a time, instructing others to stop outside the conflicting area to avoid deadlocks. Unlike traditional centralized planning methods, our system does not dictate robot paths but instead provides stop commands, functioning as a virtual traffic light. In simulation experiments with multiple robots, our approach increased the success rate of robots reaching their goals while reducing deadlocks. Furthermore, we successfully validated the system in real-world experiments with two quadruped robots and separately with wheeled Duckiebots.

Via

Access Paper or Ask Questions

Hand Over or Place On The Table? A Study On Robotic Object Delivery When The Recipient Is Occupied

Mar 14, 2025

Thieu Long Phan, Akansel Cosgun

Figure 1 for Hand Over or Place On The Table? A Study On Robotic Object Delivery When The Recipient Is Occupied

Figure 2 for Hand Over or Place On The Table? A Study On Robotic Object Delivery When The Recipient Is Occupied

Abstract:This study investigates the subjective experiences of users in two robotic object delivery methods: direct handover and table placement, when users are occupied with another task. A user study involving 15 participants engaged in a typing game revealed that table placement significantly enhances user experience compared to direct handovers, particularly in terms of satisfaction, perceived safety and intuitiveness. Additionally, handovers negatively impacted typing performance, while all participants expressed a clear preference for table placement as the delivery method. These findings highlight the advantages of table placement in scenarios requiring minimal user disruption.

* 3 pages, 2 figures

Via

Access Paper or Ask Questions

Collaborative Object Handover in a Robot Crafting Assistant

Feb 27, 2025

Leimin Tian, Shiyu Xu, Kerry He, Rachel Love, Akansel Cosgun, Dana Kulic

Figure 1 for Collaborative Object Handover in a Robot Crafting Assistant

Figure 2 for Collaborative Object Handover in a Robot Crafting Assistant

Figure 3 for Collaborative Object Handover in a Robot Crafting Assistant

Figure 4 for Collaborative Object Handover in a Robot Crafting Assistant

Abstract:Robots are increasingly working alongside people, delivering food to patrons in restaurants or helping workers on assembly lines. These scenarios often involve object handovers between the person and the robot. To achieve safe and efficient human-robot collaboration (HRC), it is important to incorporate human context in a robot's handover strategies. Therefore, in this work, we develop a collaborative handover model trained on human teleoperation data collected in a naturalistic crafting task. To evaluate the performance of this model, we conduct cross-validation experiments on the training dataset as well as a user study in the same HRC crafting task. The handover episodes and user perceptions of the autonomous handover policy were compared with those of the human teleoperated handovers. While the cross-validation experiment and user study indicate that the autonomous policy successfully achieved collaborative handovers, the comparison with human teleoperation revealed avenues for further improvements.

Via

Access Paper or Ask Questions

Supermarket-6DoF: A Real-World Grasping Dataset and Grasp Pose Representation Analysis

Feb 22, 2025

Jason Toskov, Akansel Cosgun

Figure 1 for Supermarket-6DoF: A Real-World Grasping Dataset and Grasp Pose Representation Analysis

Figure 2 for Supermarket-6DoF: A Real-World Grasping Dataset and Grasp Pose Representation Analysis

Figure 3 for Supermarket-6DoF: A Real-World Grasping Dataset and Grasp Pose Representation Analysis

Figure 4 for Supermarket-6DoF: A Real-World Grasping Dataset and Grasp Pose Representation Analysis

Abstract:We present Supermarket-6DoF, a real-world dataset of 1500 grasp attempts across 20 supermarket objects with publicly available 3D models. Unlike most existing grasping datasets that rely on analytical metrics or simulation for grasp labeling, our dataset provides ground-truth outcomes from physical robot executions. Among the few real-world grasping datasets, wile more modest in size, Supermarket-6DoF uniquely features full 6-DoF grasp poses annotated with both initial grasp success and post-grasp stability under external perturbation. We demonstrate the dataset's utility by analyzing three grasp pose representations for grasp success prediction from point clouds. Our results show that representing the gripper geometry explicitly as a point cloud achieves higher prediction accuracy compared to conventional quaternion-based grasp pose encoding.

Via

Access Paper or Ask Questions

Mixed Reality Outperforms Virtual Reality for Remote Error Resolution in Pick-and-Place Tasks

Feb 10, 2025

Advay Kumar, Stephanie Simangunsong, Pamela Carreno-Medrano, Akansel Cosgun

Figure 1 for Mixed Reality Outperforms Virtual Reality for Remote Error Resolution in Pick-and-Place Tasks

Figure 2 for Mixed Reality Outperforms Virtual Reality for Remote Error Resolution in Pick-and-Place Tasks

Figure 3 for Mixed Reality Outperforms Virtual Reality for Remote Error Resolution in Pick-and-Place Tasks

Figure 4 for Mixed Reality Outperforms Virtual Reality for Remote Error Resolution in Pick-and-Place Tasks

Abstract:This study evaluates the performance and usability of Mixed Reality (MR), Virtual Reality (VR), and camera stream interfaces for remote error resolution tasks, such as correcting warehouse packaging errors. Specifically, we consider a scenario where a robotic arm halts after detecting an error, requiring a remote operator to intervene and resolve it via pick-and-place actions. Twenty-one participants performed simulated pick-and-place tasks using each interface. A linear mixed model (LMM) analysis of task resolution time, usability scores (SUS), and mental workload scores (NASA-TLX) showed that the MR interface outperformed both VR and camera interfaces. MR enabled significantly faster task completion, was rated higher in usability, and was perceived to be less cognitively demanding. Notably, the MR interface, which projected a virtual robot onto a physical table, provided superior spatial understanding and physical reference cues. Post-study surveys further confirmed participants' preference for MR over other interfaces.

* 9 pages, 5 figures

Via

Access Paper or Ask Questions

Hand-Object Contact Detection using Grasp Quality Metrics

Jan 13, 2025

Akansel Cosgun, Thanh Vinh Nguyen

Figure 1 for Hand-Object Contact Detection using Grasp Quality Metrics

Figure 2 for Hand-Object Contact Detection using Grasp Quality Metrics

Figure 3 for Hand-Object Contact Detection using Grasp Quality Metrics

Figure 4 for Hand-Object Contact Detection using Grasp Quality Metrics

Abstract:We propose a novel hand-object contact detection system based on grasp quality metrics extracted from object and hand poses, and evaluated its performance using the DexYCB dataset. Our evaluation demonstrated the system's high accuracy (approaching 90%). Future work will focus on a real-time implementation using vision-based estimation, and integrating it to a robot-to-human handover system.

* Submitted to the 2025 IEEE/ACM International Conference on Human-Robot Interaction (HRI'25)

Via

Access Paper or Ask Questions

"One Soy Latte for Daniel": Visual and Movement Communication of Intention from a Robot Waiter to a Group of Customers

Jul 08, 2024

Seung Chan Hong, Leimin Tian, Akansel Cosgun, Dana Kulić

Abstract:Service robots are increasingly employed in the hospitality industry for delivering food orders in restaurants. However, in current practice the robot often arrives at a fixed location for each table when delivering orders to different patrons in the same dining group, thus requiring a human staff member or the customers themselves to identify and retrieve each order. This study investigates how to improve the robot's service behaviours to facilitate clear intention communication to a group of users, thus achieving accurate delivery and positive user experiences. Specifically, we conduct user studies (N=30) with a Temi service robot as a representative delivery robot currently adopted in restaurants. We investigated two factors in the robot's intent communication, namely visualisation and movement trajectories, and their influence on the objective and subjective interaction outcomes. A robot personalising its movement trajectory and stopping location in addition to displaying a visualisation of the order yields more accurate intent communication and successful order delivery, as well as more positive user perception towards the robot and its service. Our results also showed that individuals in a group have different interaction experiences.

Via

Access Paper or Ask Questions

A Review of Differentiable Simulators

Jul 08, 2024

Rhys Newbury, Jack Collins, Kerry He, Jiahe Pan, Ingmar Posner, David Howard, Akansel Cosgun

Figure 1 for A Review of Differentiable Simulators

Figure 2 for A Review of Differentiable Simulators

Figure 3 for A Review of Differentiable Simulators

Figure 4 for A Review of Differentiable Simulators

Abstract:Differentiable simulators continue to push the state of the art across a range of domains including computational physics, robotics, and machine learning. Their main value is the ability to compute gradients of physical processes, which allows differentiable simulators to be readily integrated into commonly employed gradient-based optimization schemes. To achieve this, a number of design decisions need to be considered representing trade-offs in versatility, computational speed, and accuracy of the gradients obtained. This paper presents an in-depth review of the evolving landscape of differentiable physics simulators. We introduce the foundations and core components of differentiable simulators alongside common design choices. This is followed by a practical guide and overview of open-source differentiable simulators that have been used across past research. Finally, we review and contextualize prominent applications of differentiable simulation. By offering a comprehensive review of the current state-of-the-art in differentiable simulation, this work aims to serve as a resource for researchers and practitioners looking to understand and integrate differentiable physics within their research. We conclude by highlighting current limitations as well as providing insights into future directions for the field.

* Accepted to IEEE Access

Via

Access Paper or Ask Questions

Audio-Visual Traffic Light State Detection for Urban Robots

Apr 30, 2024

Sagar Gupta, Akansel Cosgun

Figure 1 for Audio-Visual Traffic Light State Detection for Urban Robots

Figure 2 for Audio-Visual Traffic Light State Detection for Urban Robots

Figure 3 for Audio-Visual Traffic Light State Detection for Urban Robots

Figure 4 for Audio-Visual Traffic Light State Detection for Urban Robots

Abstract:We present a multimodal traffic light state detection using vision and sound, from the viewpoint of a quadruped robot navigating in urban settings. This is a challenging problem because of the visual occlusions and noise from robot locomotion. Our method combines features from raw audio with the ratios of red and green pixels within bounding boxes, identified by established vision-based detectors. The fusion method aggregates features across multiple frames in a given timeframe, increasing robustness and adaptability. Results show that our approach effectively addresses the challenge of visual occlusion and surpasses the performance of single-modality solutions when the robot is in motion. This study serves as a proof of concept, highlighting the significant, yet often overlooked, potential of multi-modal perception in robotics.

* Submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2024

Via

Access Paper or Ask Questions