Picture for Kishan Panaganti

Kishan Panaganti

Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees

Add code
May 25, 2025
Viaarxiv icon

KL-regularization Itself is Differentially Private in Bandits and RLHF

Add code
May 23, 2025
Viaarxiv icon

Distributionally Robust Direct Preference Optimization

Add code
Feb 04, 2025
Figure 1 for Distributionally Robust Direct Preference Optimization
Figure 2 for Distributionally Robust Direct Preference Optimization
Figure 3 for Distributionally Robust Direct Preference Optimization
Figure 4 for Distributionally Robust Direct Preference Optimization
Viaarxiv icon

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data

Add code
Nov 06, 2024
Figure 1 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 2 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 3 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 4 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Viaarxiv icon

Distributionally Robust Constrained Reinforcement Learning under Strong Duality

Add code
Jun 22, 2024
Figure 1 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Figure 2 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Figure 3 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Viaarxiv icon

Tractable Equilibrium Computation in Markov Games through Risk Aversion

Add code
Jun 20, 2024
Viaarxiv icon

Model-Free Robust $φ$-Divergence Reinforcement Learning Using Both Offline and Online Data

Add code
May 08, 2024
Figure 1 for Model-Free Robust $φ$-Divergence Reinforcement Learning Using Both Offline and Online Data
Viaarxiv icon

Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage

Add code
Oct 27, 2023
Figure 1 for Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage
Figure 2 for Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage
Figure 3 for Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage
Figure 4 for Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage
Viaarxiv icon

Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning

Add code
Mar 05, 2023
Figure 1 for Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning
Figure 2 for Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning
Figure 3 for Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning
Viaarxiv icon

Personalized Reward Learning with Interaction-Grounded Learning (IGL)

Add code
Nov 28, 2022
Figure 1 for Personalized Reward Learning with Interaction-Grounded Learning (IGL)
Figure 2 for Personalized Reward Learning with Interaction-Grounded Learning (IGL)
Figure 3 for Personalized Reward Learning with Interaction-Grounded Learning (IGL)
Figure 4 for Personalized Reward Learning with Interaction-Grounded Learning (IGL)
Viaarxiv icon