* 31 pages, 3 figures, extended abstract in the proceedings of
RLDM2017. (v2 revisions: Fixed a minor bug in the code w.r.t. setting seed,
as a result numbers in the simulation experiments had some slight changes,
but conclusions stayed the same. Corrected typos. Improved notations.) Access Paper or Ask Questions