Picture for Stefano V. Albrecht

Stefano V. Albrecht

Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social Dilemmas

Add code
Feb 17, 2026
Viaarxiv icon

Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour

Add code
May 23, 2025
Figure 1 for Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour
Figure 2 for Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour
Figure 3 for Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour
Figure 4 for Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour
Viaarxiv icon

HAD-Gen: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario Generation

Add code
Mar 19, 2025
Figure 1 for HAD-Gen: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario Generation
Figure 2 for HAD-Gen: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario Generation
Figure 3 for HAD-Gen: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario Generation
Figure 4 for HAD-Gen: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario Generation
Viaarxiv icon

Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning

Add code
Mar 08, 2025
Figure 1 for Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Figure 2 for Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Figure 3 for Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Figure 4 for Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Viaarxiv icon

Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning

Add code
Dec 19, 2024
Figure 1 for Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning
Viaarxiv icon

HyperMARL: Adaptive Hypernetworks for Multi-Agent RL

Add code
Dec 05, 2024
Figure 1 for HyperMARL: Adaptive Hypernetworks for Multi-Agent RL
Figure 2 for HyperMARL: Adaptive Hypernetworks for Multi-Agent RL
Figure 3 for HyperMARL: Adaptive Hypernetworks for Multi-Agent RL
Figure 4 for HyperMARL: Adaptive Hypernetworks for Multi-Agent RL
Viaarxiv icon

Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning

Add code
Jun 07, 2024
Figure 1 for Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
Figure 2 for Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
Figure 3 for Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
Figure 4 for Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
Viaarxiv icon

Highway Graph to Accelerate Reinforcement Learning

Add code
May 20, 2024
Figure 1 for Highway Graph to Accelerate Reinforcement Learning
Figure 2 for Highway Graph to Accelerate Reinforcement Learning
Figure 3 for Highway Graph to Accelerate Reinforcement Learning
Figure 4 for Highway Graph to Accelerate Reinforcement Learning
Viaarxiv icon

Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras

Add code
Apr 22, 2024
Figure 1 for Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Figure 2 for Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Figure 3 for Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Figure 4 for Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Viaarxiv icon

LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

Add code
Apr 22, 2024
Figure 1 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Figure 2 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Figure 3 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Figure 4 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Viaarxiv icon