Picture for Ilija Bogunovic

Ilija Bogunovic

Overton Pluralistic Reinforcement Learning for Large Language Models

Add code
Feb 24, 2026
Viaarxiv icon

LLM-WikiRace: Benchmarking Long-term Planning and Reasoning over Real-World Knowledge Graphs

Add code
Feb 18, 2026
Viaarxiv icon

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

Add code
Feb 05, 2026
Viaarxiv icon

Robust Bayesian Optimisation with Unbounded Corruptions

Add code
Nov 19, 2025
Viaarxiv icon

Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data

Add code
Aug 17, 2025
Figure 1 for Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
Figure 2 for Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
Figure 3 for Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
Figure 4 for Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
Viaarxiv icon

Robust Multi-Objective Controlled Decoding of Large Language Models

Add code
Mar 11, 2025
Figure 1 for Robust Multi-Objective Controlled Decoding of Large Language Models
Figure 2 for Robust Multi-Objective Controlled Decoding of Large Language Models
Figure 3 for Robust Multi-Objective Controlled Decoding of Large Language Models
Figure 4 for Robust Multi-Objective Controlled Decoding of Large Language Models
Viaarxiv icon

This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs

Add code
Mar 07, 2025
Viaarxiv icon

Mean-Field Bayesian Optimisation

Add code
Feb 17, 2025
Figure 1 for Mean-Field Bayesian Optimisation
Figure 2 for Mean-Field Bayesian Optimisation
Figure 3 for Mean-Field Bayesian Optimisation
Figure 4 for Mean-Field Bayesian Optimisation
Viaarxiv icon

Almost Surely Safe Alignment of Large Language Models at Inference-Time

Add code
Feb 03, 2025
Viaarxiv icon

No-Regret Linear Bandits under Gap-Adjusted Misspecification

Add code
Jan 09, 2025
Figure 1 for No-Regret Linear Bandits under Gap-Adjusted Misspecification
Figure 2 for No-Regret Linear Bandits under Gap-Adjusted Misspecification
Figure 3 for No-Regret Linear Bandits under Gap-Adjusted Misspecification
Viaarxiv icon