Picture for Pradeep Varakantham

Pradeep Varakantham

UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations

Add code
Oct 10, 2024
Viaarxiv icon

Towards Neural Network based Cognitive Models of Dynamic Decision-Making by Humans

Add code
Jul 24, 2024
Viaarxiv icon

Preserving the Privacy of Reward Functions in MDPs through Deception

Add code
Jul 13, 2024
Viaarxiv icon

Safety through feedback in Constrained RL

Add code
Jun 28, 2024
Viaarxiv icon

EduQate: Generating Adaptive Curricula through RMABs in Education Settings

Add code
Jun 20, 2024
Viaarxiv icon

Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning

Add code
Jun 15, 2024
Viaarxiv icon

Bootstrapping Language Models with DPO Implicit Rewards

Add code
Jun 14, 2024
Viaarxiv icon

Probabilistic Perspectives on Error Minimization in Adversarial Reinforcement Learning

Add code
Jun 07, 2024
Viaarxiv icon

Imitating Cost-Constrained Behaviors in Reinforcement Learning

Add code
Mar 27, 2024
Viaarxiv icon

SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations

Add code
Feb 20, 2024
Viaarxiv icon