Picture for Raghuram Bharadwaj Diddigi

Raghuram Bharadwaj Diddigi

Full-Gradient Successor Feature Representations

Add code
Apr 01, 2026
Viaarxiv icon

Generalisation in Multitask Fitted Q-Iteration and Offline Q-learning

Add code
Dec 23, 2025
Viaarxiv icon

Image Generation from Image Captioning -- Invertible Approach

Add code
Oct 26, 2024
Figure 1 for Image Generation from Image Captioning -- Invertible Approach
Figure 2 for Image Generation from Image Captioning -- Invertible Approach
Figure 3 for Image Generation from Image Captioning -- Invertible Approach
Viaarxiv icon

Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm

Add code
Oct 19, 2021
Figure 1 for Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Figure 2 for Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Figure 3 for Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Figure 4 for Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Viaarxiv icon

Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning

Add code
Jan 07, 2021
Figure 1 for Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning
Figure 2 for Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning
Figure 3 for Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning
Figure 4 for Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning
Viaarxiv icon

A Convergent Off-Policy Temporal Difference Algorithm

Add code
Nov 13, 2019
Figure 1 for A Convergent Off-Policy Temporal Difference Algorithm
Figure 2 for A Convergent Off-Policy Temporal Difference Algorithm
Figure 3 for A Convergent Off-Policy Temporal Difference Algorithm
Figure 4 for A Convergent Off-Policy Temporal Difference Algorithm
Viaarxiv icon

Solution of Two-Player Zero-Sum Game by Successive Relaxation

Add code
Jun 16, 2019
Figure 1 for Solution of Two-Player Zero-Sum Game by Successive Relaxation
Viaarxiv icon

Second Order Value Iteration in Reinforcement Learning

Add code
May 10, 2019
Figure 1 for Second Order Value Iteration in Reinforcement Learning
Viaarxiv icon

Successive Over Relaxation Q-Learning

Add code
Mar 15, 2019
Figure 1 for Successive Over Relaxation Q-Learning
Figure 2 for Successive Over Relaxation Q-Learning
Figure 3 for Successive Over Relaxation Q-Learning
Figure 4 for Successive Over Relaxation Q-Learning
Viaarxiv icon

An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms

Add code
Feb 11, 2019
Figure 1 for An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms
Figure 2 for An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms
Viaarxiv icon