Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chenyang Cao

MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition

May 07, 2025

Qiannan Fan, Zhuoyang Li, Jitong Li, Chenyang Cao

Figure 1 for MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition

Figure 2 for MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition

Figure 3 for MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition

Figure 4 for MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition

Abstract:With the continuous impact of epidemics, people have become accustomed to wearing masks. However, most current occluded face recognition (OFR) algorithms lack prior knowledge of occlusions, resulting in poor performance when dealing with occluded faces of varying types and severity in reality. Recognizing occluded faces is still a significant challenge, which greatly affects the convenience of people's daily lives. In this paper, we propose an identity-gated mixture of diffusion experts (MoDE) for OFR. Each diffusion-based generative expert estimates one possible complete image for occluded faces. Considering the random sampling process of the diffusion model, which introduces inevitable differences and variations between the inpainted faces and the real ones. To ensemble effective information from multi-reconstructed faces, we introduce an identity-gating network to evaluate the contribution of each reconstructed face to the identity and adaptively integrate the predictions in the decision space. Moreover, our MoDE is a plug-and-play module for most existing face recognition models. Extensive experiments on three public face datasets and two datasets in the wild validate our advanced performance for various occlusions in comparison with the competing methods.

* 8 pages,7 figures

Via

Access Paper or Ask Questions

FOSP: Fine-tuning Offline Safe Policy through World Models

Jul 06, 2024

Chenyang Cao, Yucheng Xin, Silang Wu, Longxiang He, Zichen Yan, Junbo Tan, Xueqian Wang

Figure 1 for FOSP: Fine-tuning Offline Safe Policy through World Models

Figure 2 for FOSP: Fine-tuning Offline Safe Policy through World Models

Figure 3 for FOSP: Fine-tuning Offline Safe Policy through World Models

Figure 4 for FOSP: Fine-tuning Offline Safe Policy through World Models

Abstract:Model-based Reinforcement Learning (RL) has shown its high training efficiency and capability of handling high-dimensional tasks. Regarding safety issues, safe model-based RL can achieve nearly zero-cost performance and effectively manage the trade-off between performance and safety. Nevertheless, prior works still pose safety challenges due to the online exploration in real-world deployment. To address this, some offline RL methods have emerged as solutions, which learn from a static dataset in a safe way by avoiding interactions with the environment. In this paper, we aim to further enhance safety during the deployment stage for vision-based robotic tasks by fine-tuning an offline-trained policy. We incorporate in-sample optimization, model-based policy expansion, and reachability guidance to construct a safe offline-to-online framework. Moreover, our method proves to improve the generalization of offline policy in unseen safety-constrained scenarios. Finally, the efficiency of our method is validated on simulation benchmarks with five vision-only tasks and a real robot by solving some deployment problems using limited data.

* 21 pages

Via

Access Paper or Ask Questions

Offline Goal-Conditioned Reinforcement Learning for Safety-Critical Tasks with Recovery Policy

Mar 04, 2024

Chenyang Cao, Zichen Yan, Renhao Lu, Junbo Tan, Xueqian Wang

Abstract:Offline goal-conditioned reinforcement learning (GCRL) aims at solving goal-reaching tasks with sparse rewards from an offline dataset. While prior work has demonstrated various approaches for agents to learn near-optimal policies, these methods encounter limitations when dealing with diverse constraints in complex environments, such as safety constraints. Some of these approaches prioritize goal attainment without considering safety, while others excessively focus on safety at the expense of training efficiency. In this paper, we study the problem of constrained offline GCRL and propose a new method called Recovery-based Supervised Learning (RbSL) to accomplish safety-critical tasks with various goals. To evaluate the method performance, we build a benchmark based on the robot-fetching environment with a randomly positioned obstacle and use expert or random policies to generate an offline dataset. We compare RbSL with three offline GCRL algorithms and one offline safe RL algorithm. As a result, our method outperforms the existing state-of-the-art methods to a large extent. Furthermore, we validate the practicality and effectiveness of RbSL by deploying it on a real Panda manipulator. Code is available at https://github.com/Sunlighted/RbSL.git.

* Accepted by ICRA24

Via

Access Paper or Ask Questions