Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mahdi Khoramshahi

ISIR

Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of Mind

Apr 28, 2025

Mouad Abrini, Omri Abend, Dina Acklin, Henny Admoni, Gregor Aichinger, Nitay Alon, Zahra Ashktorab, Ashish Atreja, Moises Auron, Alexander Aufreiter(+98 more)

Abstract:This volume includes a selection of papers presented at the Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2025 in Philadelphia US on 3rd March 2025. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community.

* workshop proceedings

Via

Access Paper or Ask Questions

Reasoning LLMs for User-Aware Multimodal Conversational Agents

Apr 02, 2025

Hamed Rahimi, Jeanne Cattoni, Meriem Beghili, Mouad Abrini, Mahdi Khoramshahi, Maribel Pino, Mohamed Chetouani

Abstract:Personalization in social robotics is critical for fostering effective human-robot interactions, yet systems often face the cold start problem, where initial user preferences or characteristics are unavailable. This paper proposes a novel framework called USER-LLM R1 for a user-aware conversational agent that addresses this challenge through dynamic user profiling and model initiation. Our approach integrates chain-of-thought (CoT) reasoning models to iteratively infer user preferences and vision-language models (VLMs) to initialize user profiles from multimodal inputs, enabling personalized interactions from the first encounter. Leveraging a Retrieval-Augmented Generation (RAG) architecture, the system dynamically refines user representations within an inherent CoT process, ensuring contextually relevant and adaptive responses. Evaluations on the ElderlyTech-VQA Bench demonstrate significant improvements in ROUGE-1 (+23.2%), ROUGE-2 (+0.6%), and ROUGE-L (+8%) F1 scores over state-of-the-art baselines, with ablation studies underscoring the impact of reasoning model size on performance. Human evaluations further validate the framework's efficacy, particularly for elderly users, where tailored responses enhance engagement and trust. Ethical considerations, including privacy preservation and bias mitigation, are rigorously discussed and addressed to ensure responsible deployment.

Via

Access Paper or Ask Questions

Task-Aware Robotic Grasping by evaluating Quality Diversity Solutions through Foundation Models

Nov 22, 2024

Aurel X. Appius, Emiland Garrabe, Francois Helenon, Mahdi Khoramshahi, Stephane Doncieux

Abstract:Task-aware robotic grasping is a challenging problem that requires the integration of semantic understanding and geometric reasoning. Traditional grasp planning approaches focus on stable or feasible grasps, often disregarding the specific tasks the robot needs to accomplish. This paper proposes a novel framework that leverages Large Language Models (LLMs) and Quality Diversity (QD) algorithms to enable zero-shot task-conditioned grasp selection. The framework segments objects into meaningful subparts and labels each subpart semantically, creating structured representations that can be used to prompt an LLM. By coupling semantic and geometric representations of an object's structure, the LLM's knowledge about tasks and which parts to grasp can be applied in the physical world. The QD-generated grasp archive provides a diverse set of grasps, allowing us to select the most suitable grasp based on the task. We evaluate the proposed method on a subset of the YCB dataset, where a Franka Emika robot is assigned to perform various actions based on object-specific task requirements. We created a ground truth by conducting a survey with six participants to determine the best grasp region for each task-object combination according to human intuition. The model was evaluated on 12 different objects across 4--7 object-specific tasks, achieving a weighted intersection over union (IoU) of 76.4% when compared to the survey data.

* 8 pages, 5 figures

Via

Access Paper or Ask Questions

Interaction force estimation for tactile sensor arrays: Toward tactile-based interaction control for robotic fingers

Nov 20, 2024

Elie Chelly, Andrea Cherubini, Philippe Fraisse, Faiz Ben Amar, Mahdi Khoramshahi

Abstract:Accurate estimation of interaction forces is crucial for achieving fine, dexterous control in robotic systems. Although tactile sensor arrays offer rich sensing capabilities, their effective use has been limited by challenges such as calibration complexities, nonlinearities, and deformation. In this paper, we tackle these issues by presenting a novel method for obtaining 3D force estimation using tactile sensor arrays. Unlike existing approaches that focus on specific or decoupled force components, our method estimates full 3D interaction forces across an array of distributed sensors, providing comprehensive real-time feedback. Through systematic data collection and model training, our approach overcomes the limitations of prior methods, achieving accurate and reliable tactile-based force estimation. Besides, we integrate this estimation in a real-time control loop, enabling implicit, stable force regulation that is critical for precise robotic manipulation. Experimental validation on the Allegro robot hand with uSkin sensors demonstrates the effectiveness of our approach in real-time control, and its ability to enhance the robot's adaptability and dexterity.

* 8 pages, 5 figures, submitted to ICRA 2025

Via

Access Paper or Ask Questions

Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction

Nov 08, 2024

Émiland Garrabé, Pierre Teixeira, Mahdi Khoramshahi, Stéphane Doncieux

Figure 1 for Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction

Figure 2 for Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction

Figure 3 for Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction

Figure 4 for Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction

Abstract:Recent advances in large language models (LLMs) have led to significant progress in robotics, enabling embodied agents to better understand and execute open-ended tasks. However, existing approaches using LLMs face limitations in grounding their outputs within the physical environment and aligning with the capabilities of the robot. This challenge becomes even more pronounced with smaller language models, which are more computationally efficient but less robust in task planning and execution. In this paper, we present a novel modular architecture designed to enhance the robustness of LLM-driven robotics by addressing these grounding and alignment issues. We formalize the task planning problem within a goal-conditioned POMDP framework, identify key failure modes in LLM-driven planning, and propose targeted design principles to mitigate these issues. Our architecture introduces an ``expected outcomes'' module to prevent mischaracterization of subgoals and a feedback mechanism to enable real-time error recovery. Experimental results, both in simulation and on physical robots, demonstrate that our approach significantly improves task success rates for pick-and-place and manipulation tasks compared to both larger LLMs and standard baselines. Through hardware experiments, we also demonstrate how our architecture can be run efficiently and locally. This work highlights the potential of smaller, locally-executable LLMs in robotics and provides a scalable, efficient solution for robust task execution.

* Submitted to ICRA 2025

Via

Access Paper or Ask Questions

Fingertip Contact Force Direction Control using Tactile Feedback

Jun 17, 2024

Dounia Kitouni, Elie Chelly, Mahdi Khoramshahi, Veronique Perdereau

Figure 1 for Fingertip Contact Force Direction Control using Tactile Feedback

Figure 2 for Fingertip Contact Force Direction Control using Tactile Feedback

Figure 3 for Fingertip Contact Force Direction Control using Tactile Feedback

Figure 4 for Fingertip Contact Force Direction Control using Tactile Feedback

Abstract:The human hand is an immensely sophisticated tool adept at manipulating and grasping objects of unknown characteristics. Its capability lies in perceiving interaction dynamics through touch and adjusting contact force direction and magnitude to ensure successful manipulation. Despite advancements in control algorithms, sensing technologies, compliance integration, and ongoing research, precise finger force control for dexterous manipulation using tactile sensing remains relatively unexplored.In this work, we explore the challenges related to individual finger contact force control and propose a method for directing such forces perceived through tactile sensing. The proposed method is evaluated using an Allegro hand with Xela tactile sensors. Results are presented and discussed, alongside consideration for potential future improvements.

* IEEE 20th International Conference on Automation Science and Engineering, Aug 2024, Bari (IT), France

Via

Access Paper or Ask Questions

Robotic in-hand manipulation with relaxed optimization

Jun 07, 2024

Ali Hammoud, Valerio Belcamino, Quentin Huet, Alessandro Carfì, Mahdi Khoramshahi, Veronique Perdereau, Fulvio Mastrogiovanni

Figure 1 for Robotic in-hand manipulation with relaxed optimization

Figure 2 for Robotic in-hand manipulation with relaxed optimization

Figure 3 for Robotic in-hand manipulation with relaxed optimization

Figure 4 for Robotic in-hand manipulation with relaxed optimization

Abstract:Dexterous in-hand manipulation is a unique and valuable human skill requiring sophisticated sensorimotor interaction with the environment while respecting stability constraints. Satisfying these constraints with generated motions is essential for a robotic platform to achieve reliable in-hand manipulation skills. Explicitly modelling these constraints can be challenging, but they can be implicitly modelled and learned through experience or human demonstrations. We propose a learning and control approach based on dictionaries of motion primitives generated from human demonstrations. To achieve this, we defined an optimization process that combines motion primitives to generate robot fingertip trajectories for moving an object from an initial to a desired final pose. Based on our experiments, our approach allows a robotic hand to handle objects like humans, adhering to stability constraints without requiring explicit formalization. In other words, the proposed motion primitive dictionaries learn and implicitly embed the constraints crucial to the in-hand manipulation task.

* 9 pages, 6 pictures, ROMAN 2024

Via

Access Paper or Ask Questions

Speeding up 6-DoF Grasp Sampling with Quality-Diversity

Mar 10, 2024

Johann Huber, François Hélénon, Mathilde Kappel, Elie Chelly, Mahdi Khoramshahi, Faïz Ben Amar, Stéphane Doncieux

Figure 1 for Speeding up 6-DoF Grasp Sampling with Quality-Diversity

Figure 2 for Speeding up 6-DoF Grasp Sampling with Quality-Diversity

Figure 3 for Speeding up 6-DoF Grasp Sampling with Quality-Diversity

Figure 4 for Speeding up 6-DoF Grasp Sampling with Quality-Diversity

Abstract:Recent advances in AI have led to significant results in robotic learning, including natural language-conditioned planning and efficient optimization of controllers using generative models. However, the interaction data remains the bottleneck for generalization. Getting data for grasping is a critical challenge, as this skill is required to complete many manipulation tasks. Quality-Diversity (QD) algorithms optimize a set of solutions to get diverse, high-performing solutions to a given problem. This paper investigates how QD can be combined with priors to speed up the generation of diverse grasps poses in simulation compared to standard 6-DoF grasp sampling schemes. Experiments conducted on 4 grippers with 2-to-5 fingers on standard objects show that QD outperforms commonly used methods by a large margin. Further experiments show that QD optimization automatically finds some efficient priors that are usually hard coded. The deployment of generated grasps on a 2-finger gripper and an Allegro hand shows that the diversity produced maintains sim-to-real transferability. We believe these results to be a significant step toward the generation of large datasets that can lead to robust and generalizing robotic grasping policies.

* 7 pages, 8 figures. Preprint version

Via

Access Paper or Ask Questions

A model-free approach to fingertip slip and disturbance detection for grasp stability inference

Nov 22, 2023

Dounia Kitouni, Mahdi Khoramshahi, Veronique Perdereau

Figure 1 for A model-free approach to fingertip slip and disturbance detection for grasp stability inference

Figure 2 for A model-free approach to fingertip slip and disturbance detection for grasp stability inference

Figure 3 for A model-free approach to fingertip slip and disturbance detection for grasp stability inference

Figure 4 for A model-free approach to fingertip slip and disturbance detection for grasp stability inference

Abstract:Robotic capacities in object manipulation are incomparable to those of humans. Besides years of learning, humans rely heavily on the richness of information from physical interaction with the environment. In particular, tactile sensing is crucial in providing such rich feedback. Despite its potential contributions to robotic manipulation, tactile sensing is less exploited; mainly due to the complexity of the time series provided by tactile sensors. In this work, we propose a method for assessing grasp stability using tactile sensing. More specifically, we propose a methodology to extract task-relevant features and design efficient classifiers to detect object slippage with respect to individual fingertips. We compare two classification models: support vector machine and logistic regression. We use highly sensitive Uskin tactile sensors mounted on an Allegro hand to test and validate our method. Our results demonstrate that the proposed method is effective in slippage detection in an online fashion.

* IEEE International Conference on Development and Learning 2023 (ICDL), Nov 2023, Macau, China

Via

Access Paper or Ask Questions