Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rikunari Sagara

Nagoya Institute of Technology

BeliefNest: A Joint Action Simulator for Embodied Agents with Theory of Mind

May 18, 2025

Rikunari Sagara, Koichiro Terao, Naoto Iwahashi

Abstract:This paper introduces an open-source simulator, BeliefNest, designed to enable embodied agents to perform collaborative tasks by leveraging Theory of Mind. BeliefNest dynamically and hierarchically constructs simulators within a Minecraft environment, allowing agents to explicitly represent nested belief states about themselves and others. This enables agent control in open-domain tasks that require Theory of Mind reasoning. The simulator provides a prompt generation mechanism based on each belief state, facilitating the design and evaluation of methods for agent control utilizing large language models (LLMs). We demonstrate through experiments that agents can infer others' beliefs and predict their belief-based actions in false-belief tasks.

Via

Access Paper or Ask Questions

Unsupervised Lexical Acquisition of Relative Spatial Concepts Using Spoken User Utterances

Jun 16, 2021

Rikunari Sagara, Ryo Taguchi, Akira Taniguchi, Tadahiro Taniguchi, Koosuke Hattori, Masahiro Hoguro, Taizo Umezaki

Figure 1 for Unsupervised Lexical Acquisition of Relative Spatial Concepts Using Spoken User Utterances

Figure 2 for Unsupervised Lexical Acquisition of Relative Spatial Concepts Using Spoken User Utterances

Figure 3 for Unsupervised Lexical Acquisition of Relative Spatial Concepts Using Spoken User Utterances

Figure 4 for Unsupervised Lexical Acquisition of Relative Spatial Concepts Using Spoken User Utterances

Abstract:This paper proposes methods for unsupervised lexical acquisition for relative spatial concepts using spoken user utterances. A robot with a flexible spoken dialog system must be able to acquire linguistic representation and its meaning specific to an environment through interactions with humans as children do. Specifically, relative spatial concepts (e.g., front and right) are widely used in our daily lives, however, it is not obvious which object is a reference object when a robot learns relative spatial concepts. Therefore, we propose methods by which a robot without prior knowledge of words can learn relative spatial concepts. The methods are formulated using a probabilistic model to estimate the proper reference objects and distributions representing concepts simultaneously. The experimental results show that relative spatial concepts and a phoneme sequence representing each concept can be learned under the condition that the robot does not know which located object is the reference object. Additionally, we show that two processes in the proposed method improve the estimation accuracy of the concepts: generating candidate word sequences by class n-gram and selecting word sequences using location information. Furthermore, we show that clues to reference objects improve accuracy even though the number of candidate reference objects increases.

* 27 pages, 12 figures, submitted to Advanced Robotics

Via

Access Paper or Ask Questions