Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jihoon Suh

Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis

Jun 14, 2025

Jihoon Suh, Yeongjun Jang, Kaoru Teranishi, Takashi Tanaka

Figure 1 for Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis

Figure 2 for Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis

Figure 3 for Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis

Abstract:We propose an efficient encrypted policy synthesis to develop privacy-preserving model-based reinforcement learning. We first demonstrate that the relative-entropy-regularized reinforcement learning framework offers a computationally convenient linear and ``min-free'' structure for value iteration, enabling a direct and efficient integration of fully homomorphic encryption with bootstrapping into policy synthesis. Convergence and error bounds are analyzed as encrypted policy synthesis propagates errors under the presence of encryption-induced errors including quantization and bootstrapping. Theoretical analysis is validated by numerical simulations. Results demonstrate the effectiveness of the RERL framework in integrating FHE for encrypted policy synthesis.

* IEEE Control Systems Letters, pp. 1-1, June 2025
* 6 pages, 2 figures, Published in IEEE Control Systems Letters, June 2025

Via

Access Paper or Ask Questions

Efficient Implementation of Reinforcement Learning over Homomorphic Encryption

Apr 12, 2025

Jihoon Suh, Takashi Tanaka

Abstract:We investigate encrypted control policy synthesis over the cloud. While encrypted control implementations have been studied previously, we focus on the less explored paradigm of privacy-preserving control synthesis, which can involve heavier computations ideal for cloud outsourcing. We classify control policy synthesis into model-based, simulator-driven, and data-driven approaches and examine their implementation over fully homomorphic encryption (FHE) for privacy enhancements. A key challenge arises from comparison operations (min or max) in standard reinforcement learning algorithms, which are difficult to execute over encrypted data. This observation motivates our focus on Relative-Entropy-regularized reinforcement learning (RL) problems, which simplifies encrypted evaluation of synthesis algorithms due to their comparison-free structures. We demonstrate how linearly solvable value iteration, path integral control, and Z-learning can be readily implemented over FHE. We conduct a case study of our approach through numerical simulations of encrypted Z-learning in a grid world environment using the CKKS encryption scheme, showing convergence with acceptable approximation error. Our work suggests the potential for secure and efficient cloud-based reinforcement learning.

* Journal of The Society of Instrument and Control Engineers, vol. 64, no. 4, pp. 223-229, 2025
* 6 pages, 3 figures

Via

Access Paper or Ask Questions