Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

PaLM: Scaling Language Modeling with Pathways



Aakanksha Chowdhery , Sharan Narang , Jacob Devlin , Maarten Bosma , Gaurav Mishra , Adam Roberts , Paul Barham , Hyung Won Chung , Charles Sutton , Sebastian Gehrmann , Parker Schuh , Kensen Shi , Sasha Tsvyashchenko , Joshua Maynez , Abhishek Rao , Parker Barnes , Yi Tay , Noam Shazeer , Vinodkumar Prabhakaran , Emily Reif , Nan Du , Ben Hutchinson , Reiner Pope , James Bradbury , Jacob Austin , Michael Isard , Guy Gur-Ari , Pengcheng Yin , Toju Duke , Anselm Levskaya , Sanjay Ghemawat , Sunipa Dev , Henryk Michalewski , Xavier Garcia , Vedant Misra , Kevin Robinson , Liam Fedus , Denny Zhou , Daphne Ippolito , David Luan , Hyeontaek Lim , Barret Zoph , Alexander Spiridonov , Ryan Sepassi , David Dohan , Shivani Agrawal , Mark Omernick , Andrew M. Dai , Thanumalayan Sankaranarayana Pillai , Marie Pellat , Aitor Lewkowycz , Erica Moreira , Rewon Child , Oleksandr Polozov , Katherine Lee , Zongwei Zhou , Xuezhi Wang , Brennan Saeta , Mark Diaz , Orhan Firat , Michele Catasta , Jason Wei , Kathy Meier-Hellstern , Douglas Eck , Jeff Dean , Slav Petrov , Noah Fiedel


   Access Paper or Ask Questions

Self-Consistency Improves Chain of Thought Reasoning in Language Models



Xuezhi Wang , Jason Wei , Dale Schuurmans , Quoc Le , Ed Chi , Sharan Narang , Aakanksha Chowdhery , Denny Zhou

* V2: added PaLM based results 

   Access Paper or Ask Questions

Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$



Adam Roberts , Hyung Won Chung , Anselm Levskaya , Gaurav Mishra , James Bradbury , Daniel Andor , Sharan Narang , Brian Lester , Colin Gaffney , Afroz Mohiuddin , Curtis Hawthorne , Aitor Lewkowycz , Alex Salcianu , Marc van Zee , Jacob Austin , Sebastian Goodman , Livio Baldini Soares , Haitang Hu , Sasha Tsvyashchenko , Aakanksha Chowdhery , Jasmijn Bastings , Jannis Bulian , Xavier Garcia , Jianmo Ni , Andrew Chen , Kathleen Kenealy , Jonathan H. Clark , Stephan Lee , Dan Garrette , James Lee-Thorp , Colin Raffel , Noam Shazeer , Marvin Ritter , Maarten Bosma , Alexandre Passos , Jeremy Maitin-Shepard , Noah Fiedel , Mark Omernick , Brennan Saeta , Ryan Sepassi , Alexander Spiridonov , Joshua Newlan , Andrea Gesmundo


   Access Paper or Ask Questions

Pathways: Asynchronous Distributed Dataflow for ML



Paul Barham , Aakanksha Chowdhery , Jeff Dean , Sanjay Ghemawat , Steven Hand , Dan Hurt , Michael Isard , Hyeontaek Lim , Ruoming Pang , Sudip Roy , Brennan Saeta , Parker Schuh , Ryan Sepassi , Laurent El Shafey , Chandramohan A. Thekkath , Yonghui Wu

* MLSys 2022 

   Access Paper or Ask Questions

Sparse is Enough in Scaling Transformers



Sebastian Jaszczur , Aakanksha Chowdhery , Afroz Mohiuddin , Łukasz Kaiser , Wojciech Gajewski , Henryk Michalewski , Jonni Kanerva

* NeurIPS 2021 

   Access Paper or Ask Questions

DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning



Hussein Hazimeh , Zhe Zhao , Aakanksha Chowdhery , Maheswaran Sathiamoorthy , Yihua Chen , Rahul Mazumder , Lichan Hong , Ed H. Chi


   Access Paper or Ask Questions

Visual Wake Words Dataset



Aakanksha Chowdhery , Pete Warden , Jonathon Shlens , Andrew Howard , Rocky Rhodes

* 10 pages, 4 figures 

   Access Paper or Ask Questions