Alert button
Picture for Giovanni Montana

Giovanni Montana

Alert button

REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes

Add code
Bookmark button
Alert button
Jan 16, 2024
David Ireland, Giovanni Montana

Viaarxiv icon

Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks

Add code
Bookmark button
Alert button
Apr 08, 2023
George Watkins, Giovanni Montana, Juergen Branke

Figure 1 for Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks
Figure 2 for Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks
Figure 3 for Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks
Figure 4 for Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks
Viaarxiv icon

Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning

Add code
Bookmark button
Alert button
Mar 26, 2023
Alex Beeson, Giovanni Montana

Figure 1 for Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning
Figure 2 for Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning
Figure 3 for Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning
Figure 4 for Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning
Viaarxiv icon

Goal-conditioned Offline Reinforcement Learning through State Space Partitioning

Add code
Bookmark button
Alert button
Mar 16, 2023
Mianchu Wang, Yue Jin, Giovanni Montana

Figure 1 for Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
Figure 2 for Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
Figure 3 for Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
Figure 4 for Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
Viaarxiv icon

Model-based trajectory stitching for improved behavioural cloning and its applications

Add code
Bookmark button
Alert button
Dec 08, 2022
Charles A. Hepburn, Giovanni Montana

Figure 1 for Model-based trajectory stitching for improved behavioural cloning and its applications
Figure 2 for Model-based trajectory stitching for improved behavioural cloning and its applications
Figure 3 for Model-based trajectory stitching for improved behavioural cloning and its applications
Figure 4 for Model-based trajectory stitching for improved behavioural cloning and its applications
Viaarxiv icon

Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning

Add code
Bookmark button
Alert button
Nov 21, 2022
Alex Beeson, Giovanni Montana

Figure 1 for Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning
Figure 2 for Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning
Figure 3 for Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning
Figure 4 for Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning
Viaarxiv icon

Model-based Trajectory Stitching for Improved Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 21, 2022
Charles A. Hepburn, Giovanni Montana

Figure 1 for Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Figure 2 for Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Figure 3 for Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Figure 4 for Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Viaarxiv icon

Assessing the Performance of Automated Prediction and Ranking of Patient Age from Chest X-rays Against Clinicians

Add code
Bookmark button
Alert button
Jul 04, 2022
Matthew MacPherson, Keerthini Muthuswamy, Ashik Amlani, Charles Hutchinson, Vicky Goh, Giovanni Montana

Figure 1 for Assessing the Performance of Automated Prediction and Ranking of Patient Age from Chest X-rays Against Clinicians
Figure 2 for Assessing the Performance of Automated Prediction and Ranking of Patient Age from Chest X-rays Against Clinicians
Figure 3 for Assessing the Performance of Automated Prediction and Ranking of Patient Age from Chest X-rays Against Clinicians
Figure 4 for Assessing the Performance of Automated Prediction and Ranking of Patient Age from Chest X-rays Against Clinicians
Viaarxiv icon

LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation

Add code
Bookmark button
Alert button
May 20, 2022
David Ireland, Giovanni Montana

Figure 1 for LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation
Figure 2 for LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation
Figure 3 for LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation
Figure 4 for LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation
Viaarxiv icon

A persistent homology-based topological loss for CNN-based multi-class segmentation of CMR

Add code
Bookmark button
Alert button
Jul 27, 2021
Nick Byrne, James R Clough, Isra Valverde, Giovanni Montana, Andrew P King

Figure 1 for A persistent homology-based topological loss for CNN-based multi-class segmentation of CMR
Figure 2 for A persistent homology-based topological loss for CNN-based multi-class segmentation of CMR
Figure 3 for A persistent homology-based topological loss for CNN-based multi-class segmentation of CMR
Figure 4 for A persistent homology-based topological loss for CNN-based multi-class segmentation of CMR
Viaarxiv icon