Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Weiyang Li

Advancing Embodied Agent Security: From Safety Benchmarks to Input Moderation

Apr 22, 2025

Ning Wang, Zihan Yan, Weiyang Li, Chuan Ma, He Chen, Tao Xiang

Figure 1 for Advancing Embodied Agent Security: From Safety Benchmarks to Input Moderation

Figure 2 for Advancing Embodied Agent Security: From Safety Benchmarks to Input Moderation

Figure 3 for Advancing Embodied Agent Security: From Safety Benchmarks to Input Moderation

Figure 4 for Advancing Embodied Agent Security: From Safety Benchmarks to Input Moderation

Abstract:Embodied agents exhibit immense potential across a multitude of domains, making the assurance of their behavioral safety a fundamental prerequisite for their widespread deployment. However, existing research predominantly concentrates on the security of general large language models, lacking specialized methodologies for establishing safety benchmarks and input moderation tailored to embodied agents. To bridge this gap, this paper introduces a novel input moderation framework, meticulously designed to safeguard embodied agents. This framework encompasses the entire pipeline, including taxonomy definition, dataset curation, moderator architecture, model training, and rigorous evaluation. Notably, we introduce EAsafetyBench, a meticulously crafted safety benchmark engineered to facilitate both the training and stringent assessment of moderators specifically designed for embodied agents. Furthermore, we propose Pinpoint, an innovative prompt-decoupled input moderation scheme that harnesses a masked attention mechanism to effectively isolate and mitigate the influence of functional prompts on moderation tasks. Extensive experiments conducted on diverse benchmark datasets and models validate the feasibility and efficacy of the proposed approach. The results demonstrate that our methodologies achieve an impressive average detection accuracy of 94.58%, surpassing the performance of existing state-of-the-art techniques, alongside an exceptional moderation processing time of merely 0.002 seconds per instance.

* 9 pages

Via

Access Paper or Ask Questions

3PSDF: Three-Pole Signed Distance Function for Learning Surfaces with Arbitrary Topologies

May 31, 2022

Weikai Chen, Cheng Lin, Weiyang Li, Bo Yang

Figure 1 for 3PSDF: Three-Pole Signed Distance Function for Learning Surfaces with Arbitrary Topologies

Figure 2 for 3PSDF: Three-Pole Signed Distance Function for Learning Surfaces with Arbitrary Topologies

Figure 3 for 3PSDF: Three-Pole Signed Distance Function for Learning Surfaces with Arbitrary Topologies

Figure 4 for 3PSDF: Three-Pole Signed Distance Function for Learning Surfaces with Arbitrary Topologies

Abstract:Recent advances in learning 3D shapes using neural implicit functions have achieved impressive results by breaking the previous barrier of resolution and diversity for varying topologies. However, most of such approaches are limited to closed surfaces as they require the space to be divided into inside and outside. More recent works based on unsigned distance function have been proposed to handle complex geometry containing both the open and closed surfaces. Nonetheless, as their direct outputs are point clouds, robustly obtaining high-quality meshing results from discrete points remains an open question. We present a novel learnable implicit representation, called the three-pole signed distance function (3PSDF), that can represent non-watertight 3D shapes with arbitrary topologies while supporting easy field-to-mesh conversion using the classic Marching Cubes algorithm. The key to our method is the introduction of a new sign, the NULL sign, in addition to the conventional in and out labels. The existence of the null sign could stop the formation of a closed isosurface derived from the bisector of the in/out regions. Further, we propose a dedicated learning framework to effectively learn 3PSDF without worrying about the vanishing gradient due to the null labels. Experimental results show that our approach outperforms the previous state-of-the-art methods in a wide range of benchmarks both quantitatively and qualitatively.

* Accepted to CVPR 2022

Via

Access Paper or Ask Questions