* Published in UAI 2016. We have made the following change in this
revision: instead of expressing convergence rate results in terms of the
iterate difference, we state them in terms of the iterate distance divided by
the step-size (a measure of first-order optimality). We also removed some
claims about the performance with a fixed step size Access Paper or Ask Questions