Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning Policies for Contextual Submodular Prediction

May 11, 2013

Stephane Ross, Jiaji Zhou, Yisong Yue, Debadeepta Dey, J. Andrew Bagnell

Figure 1 for Learning Policies for Contextual Submodular Prediction

Figure 2 for Learning Policies for Contextual Submodular Prediction

Share this with someone who'll enjoy it:

Abstract:Many prediction domains, such as ad placement, recommendation, trajectory prediction, and document summarization, require predicting a set or list of options. Such lists are often evaluated using submodular reward functions that measure both quality and diversity. We propose a simple, efficient, and provably near-optimal approach to optimizing such prediction problems based on no-regret learning. Our method leverages a surprising result from online submodular optimization: a single no-regret online learner can compete with an optimal sequence of predictions. Compared to previous work, which either learn a sequence of classifiers or rely on stronger assumptions such as realizability, we ensure both data-efficiency as well as performance guarantees in the fully agnostic setting. Experiments validate the efficiency and applicability of the approach on a wide range of problems including manipulator trajectory optimization, news recommendation and document summarization.

* 13 pages. To appear in proceedings of the International Conference on Machine Learning (ICML), 2013

View paper on

Share this with someone who'll enjoy it:

Title:Learning Policies for Contextual Submodular Prediction

Paper and Code