Picture for Goran Radanović

Goran Radanović

Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks?

Add code
Jun 07, 2025
Viaarxiv icon

Independent Learning in Performative Markov Potential Games

Add code
Apr 29, 2025
Viaarxiv icon

Policy Teaching via Data Poisoning in Learning from Human Preferences

Add code
Mar 13, 2025
Viaarxiv icon

Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints

Add code
Jan 14, 2025
Viaarxiv icon

Corruption-Robust Offline Two-Player Zero-Sum Markov Games

Add code
Mar 04, 2024
Viaarxiv icon

Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

Add code
Mar 04, 2024
Viaarxiv icon

Corruption Robust Offline Reinforcement Learning with Human Feedback

Add code
Feb 09, 2024
Viaarxiv icon