Picture for Pradeep Varakantham

Pradeep Varakantham

Safety through feedback in Constrained RL

Add code
Jun 28, 2024
Viaarxiv icon

EduQate: Generating Adaptive Curricula through RMABs in Education Settings

Add code
Jun 20, 2024
Viaarxiv icon

Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning

Add code
Jun 15, 2024
Viaarxiv icon

Bootstrapping Language Models with DPO Implicit Rewards

Add code
Jun 14, 2024
Viaarxiv icon

Probabilistic Perspectives on Error Minimization in Adversarial Reinforcement Learning

Add code
Jun 07, 2024
Viaarxiv icon

Imitating Cost-Constrained Behaviors in Reinforcement Learning

Add code
Mar 27, 2024
Figure 1 for Imitating Cost-Constrained Behaviors in Reinforcement Learning
Figure 2 for Imitating Cost-Constrained Behaviors in Reinforcement Learning
Figure 3 for Imitating Cost-Constrained Behaviors in Reinforcement Learning
Figure 4 for Imitating Cost-Constrained Behaviors in Reinforcement Learning
Viaarxiv icon

SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations

Add code
Feb 20, 2024
Viaarxiv icon

Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning

Add code
Dec 26, 2023
Viaarxiv icon

Training Reinforcement Learning Agents and Humans With Difficulty-Conditioned Generators

Add code
Dec 04, 2023
Viaarxiv icon

Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning

Add code
Nov 26, 2023
Viaarxiv icon