Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Fair Exploration via Axiomatic Bargaining

Jun 04, 2021

Jackie Baek, Vivek F. Farias

Figure 1 for Fair Exploration via Axiomatic Bargaining

Figure 2 for Fair Exploration via Axiomatic Bargaining

Figure 3 for Fair Exploration via Axiomatic Bargaining

Share this with someone who'll enjoy it:

Abstract:Motivated by the consideration of fairly sharing the cost of exploration between multiple groups in learning problems, we develop the Nash bargaining solution in the context of multi-armed bandits. Specifically, the 'grouped' bandit associated with any multi-armed bandit problem associates, with each time step, a single group from some finite set of groups. The utility gained by a given group under some learning policy is naturally viewed as the reduction in that group's regret relative to the regret that group would have incurred 'on its own'. We derive policies that yield the Nash bargaining solution relative to the set of incremental utilities possible under any policy. We show that on the one hand, the 'price of fairness' under such policies is limited, while on the other hand, regret optimal policies are arbitrarily unfair under generic conditions. Our theoretical development is complemented by a case study on contextual bandits for warfarin dosing where we are concerned with the cost of exploration across multiple races and age groups.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Fair Exploration via Axiomatic Bargaining

Paper and Code