Picture for Durgesh Kalwar

Durgesh Kalwar

RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs

Add code
May 19, 2025
Viaarxiv icon

Using General Value Functions to Learn Domain-Backed Inventory Management Policies

Add code
Nov 03, 2023
Viaarxiv icon

Safe Sequential Optimization for Switching Environments

Add code
Nov 03, 2023
Viaarxiv icon

Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning

Add code
Mar 02, 2022
Figure 1 for Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Figure 2 for Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Figure 3 for Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Figure 4 for Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Viaarxiv icon