Picture for Prakash Panangaden

Prakash Panangaden

McGill University

Conditions on Preference Relations that Guarantee the Existence of Optimal Policies

Nov 03, 2023
Viaarxiv icon

Policy Gradient Methods in the Presence of Symmetries and State Abstractions

Add code
May 09, 2023
Figure 1 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 2 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 3 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 4 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Viaarxiv icon

Continuous MDP Homomorphisms and Homomorphic Policy Gradient

Add code
Sep 15, 2022
Figure 1 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Figure 2 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Figure 3 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Figure 4 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Viaarxiv icon

Riemannian Diffusion Models

Aug 16, 2022
Figure 1 for Riemannian Diffusion Models
Figure 2 for Riemannian Diffusion Models
Figure 3 for Riemannian Diffusion Models
Figure 4 for Riemannian Diffusion Models
Viaarxiv icon

Extracting Weighted Automata for Approximate Minimization in Language Modelling

Jun 05, 2021
Viaarxiv icon

MICo: Learning improved representations via sampling-based state similarity for Markov decision processes

Add code
Jun 03, 2021
Figure 1 for MICo: Learning improved representations via sampling-based state similarity for Markov decision processes
Figure 2 for MICo: Learning improved representations via sampling-based state similarity for Markov decision processes
Figure 3 for MICo: Learning improved representations via sampling-based state similarity for Markov decision processes
Figure 4 for MICo: Learning improved representations via sampling-based state similarity for Markov decision processes
Viaarxiv icon

A Study of Policy Gradient on a Class of Exactly Solvable Models

Nov 03, 2020
Figure 1 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Figure 2 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Figure 3 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Figure 4 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Viaarxiv icon

A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

Mar 27, 2020
Figure 1 for A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms
Viaarxiv icon

Latent Variable Modelling with Hyperbolic Normalizing Flows

Add code
Feb 18, 2020
Figure 1 for Latent Variable Modelling with Hyperbolic Normalizing Flows
Figure 2 for Latent Variable Modelling with Hyperbolic Normalizing Flows
Figure 3 for Latent Variable Modelling with Hyperbolic Normalizing Flows
Figure 4 for Latent Variable Modelling with Hyperbolic Normalizing Flows
Viaarxiv icon

Proceedings of the 11th workshop on Quantum Physics and Logic

Dec 28, 2014
Viaarxiv icon