Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Boosted Prompt Ensembles for Large Language Models


Apr 12, 2023
Silviu Pitis, Michael R. Zhang, Andrew Wang, Jimmy Ba

Add code


   Access Paper or Ask Questions

Large Language Models Are Human-Level Prompt Engineers


Nov 03, 2022
Yongchao Zhou, Andrei Ioan Muresanu, Ziwen Han, Keiran Paster, Silviu Pitis, Harris Chan, Jimmy Ba

Add code


   Access Paper or Ask Questions

MoCoDA: Model-based Counterfactual Data Augmentation


Oct 20, 2022
Silviu Pitis, Elliot Creager, Ajay Mandlekar, Animesh Garg

Add code

* In Proceedings of NeurIPS 2022. 10 pages (+3 references, +10 appendix). Code available at https://github.com/spitis/mocoda 

   Access Paper or Ask Questions

Counterfactual Data Augmentation using Locally Factored Dynamics


Jul 06, 2020
Silviu Pitis, Elliot Creager, Animesh Garg

Add code

* 12 pages (+12 appendix). Code available at \url{https://github.com/spitis/mrl

   Access Paper or Ask Questions

Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning


Jul 06, 2020
Silviu Pitis, Harris Chan, Stephen Zhao, Bradly Stadie, Jimmy Ba

Add code

* 12 pages (+12 appendix). Published as a conference paper at ICML 2020. Code available at https://github.com/spitis/mrl 

   Access Paper or Ask Questions

An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality


Feb 14, 2020
Silviu Pitis, Harris Chan, Kiarash Jamali, Jimmy Ba

Add code

* 11 pages (+18 appendix). Published as a conference paper at ICLR 2020. https://openreview.net/forum?id=HJeiDpVFPr 

   Access Paper or Ask Questions

Objective Social Choice: Using Auxiliary Information to Improve Voting Outcomes


Jan 27, 2020
Silviu Pitis, Michael R. Zhang

Add code

* 10 pages, 3 figures. To appear in proceedings of AAMAS 2020 

   Access Paper or Ask Questions

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning


Sep 09, 2019
Kristopher De Asis, Alan Chan, Silviu Pitis, Richard S. Sutton, Daniel Graves

Add code


   Access Paper or Ask Questions

Source Traces for Temporal Difference Learning


Feb 08, 2019
Silviu Pitis

Add code

* 8 pages. In proceedings of AAAI 2018. Slides and bibtex available at https://silviupitis.com/#source-traces-for-temporal-difference-learning 

   Access Paper or Ask Questions

Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach


Feb 08, 2019
Silviu Pitis

Add code

* 8 pages + 1 page supplement. In proceedings of AAAI 2019. Slides, poster and bibtex available at https://silviupitis.com/#rethinking-the-discount-factor-in-reinforcement-learning-a-decision-theoretic-approach 

   Access Paper or Ask Questions