Picture for Sebastien Gros

Sebastien Gros

Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies

Add code
May 22, 2025
Viaarxiv icon

Differentiable Nonlinear Model Predictive Control

Add code
May 02, 2025
Viaarxiv icon

Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification

Add code
Feb 04, 2025
Figure 1 for Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Figure 2 for Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Figure 3 for Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Figure 4 for Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Viaarxiv icon

All AI Models are Wrong, but Some are Optimal

Add code
Jan 10, 2025
Viaarxiv icon

Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration

Add code
Nov 27, 2024
Figure 1 for Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration
Figure 2 for Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration
Figure 3 for Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration
Figure 4 for Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration
Viaarxiv icon

Flipping-based Policy for Chance-Constrained Markov Decision Processes

Add code
Oct 09, 2024
Viaarxiv icon

Battery Capacity Knee Identification Using Unsupervised Time Series Segmentation

Add code
Apr 23, 2023
Figure 1 for Battery Capacity Knee Identification Using Unsupervised Time Series Segmentation
Figure 2 for Battery Capacity Knee Identification Using Unsupervised Time Series Segmentation
Figure 3 for Battery Capacity Knee Identification Using Unsupervised Time Series Segmentation
Figure 4 for Battery Capacity Knee Identification Using Unsupervised Time Series Segmentation
Viaarxiv icon

Deep active learning for nonlinear system identification

Add code
Feb 24, 2023
Viaarxiv icon

Learning-based MPC from Big Data Using Reinforcement Learning

Add code
Jan 04, 2023
Figure 1 for Learning-based MPC from Big Data Using Reinforcement Learning
Figure 2 for Learning-based MPC from Big Data Using Reinforcement Learning
Figure 3 for Learning-based MPC from Big Data Using Reinforcement Learning
Viaarxiv icon

Bridging the gap between QP-based and MPC-based RL

Add code
May 18, 2022
Figure 1 for Bridging the gap between QP-based and MPC-based RL
Figure 2 for Bridging the gap between QP-based and MPC-based RL
Figure 3 for Bridging the gap between QP-based and MPC-based RL
Figure 4 for Bridging the gap between QP-based and MPC-based RL
Viaarxiv icon