Picture for Goran Radanović

Goran Radanović

Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks?

Add code
Jun 07, 2025
Viaarxiv icon

Independent Learning in Performative Markov Potential Games

Add code
Apr 29, 2025
Figure 1 for Independent Learning in Performative Markov Potential Games
Figure 2 for Independent Learning in Performative Markov Potential Games
Figure 3 for Independent Learning in Performative Markov Potential Games
Figure 4 for Independent Learning in Performative Markov Potential Games
Viaarxiv icon

Policy Teaching via Data Poisoning in Learning from Human Preferences

Add code
Mar 13, 2025
Viaarxiv icon

Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints

Add code
Jan 14, 2025
Viaarxiv icon

Corruption-Robust Offline Two-Player Zero-Sum Markov Games

Add code
Mar 04, 2024
Viaarxiv icon

Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

Add code
Mar 04, 2024
Viaarxiv icon

Corruption Robust Offline Reinforcement Learning with Human Feedback

Add code
Feb 09, 2024
Viaarxiv icon