Picture for Cosmin Paduraru

Cosmin Paduraru

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Towards practical reinforcement learning for tokamak magnetic control

Add code
Jul 21, 2023
Figure 1 for Towards practical reinforcement learning for tokamak magnetic control
Figure 2 for Towards practical reinforcement learning for tokamak magnetic control
Figure 3 for Towards practical reinforcement learning for tokamak magnetic control
Figure 4 for Towards practical reinforcement learning for tokamak magnetic control
Viaarxiv icon

Optimizing Memory Mapping Using Deep Reinforcement Learning

Add code
May 11, 2023
Figure 1 for Optimizing Memory Mapping Using Deep Reinforcement Learning
Figure 2 for Optimizing Memory Mapping Using Deep Reinforcement Learning
Figure 3 for Optimizing Memory Mapping Using Deep Reinforcement Learning
Figure 4 for Optimizing Memory Mapping Using Deep Reinforcement Learning
Viaarxiv icon

Transformers Meet Directed Graphs

Add code
Jan 31, 2023
Figure 1 for Transformers Meet Directed Graphs
Figure 2 for Transformers Meet Directed Graphs
Figure 3 for Transformers Meet Directed Graphs
Figure 4 for Transformers Meet Directed Graphs
Viaarxiv icon

Controlling Commercial Cooling Systems Using Reinforcement Learning

Add code
Nov 11, 2022
Figure 1 for Controlling Commercial Cooling Systems Using Reinforcement Learning
Figure 2 for Controlling Commercial Cooling Systems Using Reinforcement Learning
Figure 3 for Controlling Commercial Cooling Systems Using Reinforcement Learning
Figure 4 for Controlling Commercial Cooling Systems Using Reinforcement Learning
Viaarxiv icon

Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning

Add code
Sep 16, 2022
Figure 1 for Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
Figure 2 for Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
Figure 3 for Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
Figure 4 for Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
Viaarxiv icon

Semi-analytical Industrial Cooling System Model for Reinforcement Learning

Add code
Jul 26, 2022
Figure 1 for Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Figure 2 for Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Figure 3 for Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Figure 4 for Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Viaarxiv icon

COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation

Add code
Apr 19, 2022
Figure 1 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Figure 2 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Figure 3 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Figure 4 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Viaarxiv icon

Active Offline Policy Selection

Add code
Jun 18, 2021
Figure 1 for Active Offline Policy Selection
Figure 2 for Active Offline Policy Selection
Figure 3 for Active Offline Policy Selection
Figure 4 for Active Offline Policy Selection
Viaarxiv icon

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization

Add code
Apr 28, 2021
Figure 1 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Figure 2 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Figure 3 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Figure 4 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Viaarxiv icon