Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

PAC Mode Estimation using PPR Martingale Confidence Sequences



Shubham Anand Jain , Sanit Gupta , Denil Mehta , Inderjeet Jayakumar Nair , Rohan Shah , Jian Vora , Sushil Khyalia , Sourav Das , Vinay J. Ribeiro , Shivaram Kalyanakrishnan

* 30 pages, 2 figures 

   Access Paper or Ask Questions

An Analysis of Frame-skipping in Reinforcement Learning



Shivaram Kalyanakrishnan , Siddharth Aravindan , Vishwajeet Bagdawat , Varun Bhatt , Harshith Goka , Archit Gupta , Kalpesh Krishna , Vihari Piratla


   Access Paper or Ask Questions

Lower Bounds for Policy Iteration on Multi-action MDPs



Kumar Ashutosh , Sarthak Consul , Bhishma Dedhia , Parthasarathi Khirwadkar , Sahil Shah , Shivaram Kalyanakrishnan

* 8 pages, 3 diagrams, 2 tables. Paper in IEEE CDC 2020 

   Access Paper or Ask Questions

Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory



Arghya Roy Chaudhuri , Shivaram Kalyanakrishnan


   Access Paper or Ask Questions

PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits



Arghya Roy Chaudhuri , Shivaram Kalyanakrishnan


   Access Paper or Ask Questions