Alert button
Picture for Trevor Gale

Trevor Gale

Alert button

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Bookmark button
Alert button
Apr 11, 2024
Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Armand Joulin, Noah Fiedel, Evan Senter, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, David Budden, Arnaud Doucet, Sharad Vikram, Adam Paszke, Trevor Gale, Sebastian Borgeaud, Charlie Chen, Andy Brock, Antonia Paterson, Jenny Brennan, Meg Risdal, Raj Gundluru, Nesh Devanathan, Paul Mooney, Nilay Chauhan, Phil Culliton, Luiz GUStavo Martins, Elisa Bandy, David Huntsperger, Glenn Cameron, Arthur Zucker, Tris Warkentin, Ludovic Peran, Minh Giang, Zoubin Ghahramani, Clément Farabet, Koray Kavukcuoglu, Demis Hassabis, Raia Hadsell, Yee Whye Teh, Nando de Frietas

Viaarxiv icon

JaxPruner: A concise library for sparsity research

Add code
Bookmark button
Alert button
May 02, 2023
Joo Hyung Lee, Wonpyo Park, Nicole Mitchell, Jonathan Pilault, Johan Obando-Ceron, Han-Byul Kim, Namhoon Lee, Elias Frantar, Yun Long, Amir Yazdanbakhsh, Shivani Agrawal, Suvinay Subramanian, Xin Wang, Sheng-Chun Kao, Xingyao Zhang, Trevor Gale, Aart Bik, Woohyun Han, Milen Ferev, Zhonglin Han, Hong-Seok Kim, Yann Dauphin, Gintare Karolina Dziugaite, Pablo Samuel Castro, Utku Evci

Figure 1 for JaxPruner: A concise library for sparsity research
Figure 2 for JaxPruner: A concise library for sparsity research
Viaarxiv icon

MegaBlocks: Efficient Sparse Training with Mixture-of-Experts

Add code
Bookmark button
Alert button
Nov 29, 2022
Trevor Gale, Deepak Narayanan, Cliff Young, Matei Zaharia

Figure 1 for MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
Figure 2 for MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
Figure 3 for MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
Figure 4 for MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
Viaarxiv icon

On the Opportunities and Risks of Foundation Models

Add code
Bookmark button
Alert button
Aug 18, 2021
Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh, Li Fei-Fei, Chelsea Finn, Trevor Gale, Lauren Gillespie, Karan Goel, Noah Goodman, Shelby Grossman, Neel Guha, Tatsunori Hashimoto, Peter Henderson, John Hewitt, Daniel E. Ho, Jenny Hong, Kyle Hsu, Jing Huang, Thomas Icard, Saahil Jain, Dan Jurafsky, Pratyusha Kalluri, Siddharth Karamcheti, Geoff Keeling, Fereshte Khani, Omar Khattab, Pang Wei Kohd, Mark Krass, Ranjay Krishna, Rohith Kuditipudi, Ananya Kumar, Faisal Ladhak, Mina Lee, Tony Lee, Jure Leskovec, Isabelle Levent, Xiang Lisa Li, Xuechen Li, Tengyu Ma, Ali Malik, Christopher D. Manning, Suvir Mirchandani, Eric Mitchell, Zanele Munyikwa, Suraj Nair, Avanika Narayan, Deepak Narayanan, Ben Newman, Allen Nie, Juan Carlos Niebles, Hamed Nilforoshan, Julian Nyarko, Giray Ogut, Laurel Orr, Isabel Papadimitriou, Joon Sung Park, Chris Piech, Eva Portelance, Christopher Potts, Aditi Raghunathan, Rob Reich, Hongyu Ren, Frieda Rong, Yusuf Roohani, Camilo Ruiz, Jack Ryan, Christopher Ré, Dorsa Sadigh, Shiori Sagawa, Keshav Santhanam, Andy Shih, Krishnan Srinivasan, Alex Tamkin, Rohan Taori, Armin W. Thomas, Florian Tramèr, Rose E. Wang, William Wang, Bohan Wu, Jiajun Wu, Yuhuai Wu, Sang Michael Xie, Michihiro Yasunaga, Jiaxuan You, Matei Zaharia, Michael Zhang, Tianyi Zhang, Xikun Zhang, Yuhui Zhang, Lucia Zheng, Kaitlyn Zhou, Percy Liang

Figure 1 for On the Opportunities and Risks of Foundation Models
Figure 2 for On the Opportunities and Risks of Foundation Models
Figure 3 for On the Opportunities and Risks of Foundation Models
Figure 4 for On the Opportunities and Risks of Foundation Models
Viaarxiv icon

Sparse GPU Kernels for Deep Learning

Add code
Bookmark button
Alert button
Jun 18, 2020
Trevor Gale, Matei Zaharia, Cliff Young, Erich Elsen

Figure 1 for Sparse GPU Kernels for Deep Learning
Figure 2 for Sparse GPU Kernels for Deep Learning
Figure 3 for Sparse GPU Kernels for Deep Learning
Figure 4 for Sparse GPU Kernels for Deep Learning
Viaarxiv icon

Rigging the Lottery: Making All Tickets Winners

Add code
Bookmark button
Alert button
Nov 25, 2019
Utku Evci, Trevor Gale, Jacob Menick, Pablo Samuel Castro, Erich Elsen

Figure 1 for Rigging the Lottery: Making All Tickets Winners
Figure 2 for Rigging the Lottery: Making All Tickets Winners
Figure 3 for Rigging the Lottery: Making All Tickets Winners
Figure 4 for Rigging the Lottery: Making All Tickets Winners
Viaarxiv icon

Fast Sparse ConvNets

Add code
Bookmark button
Alert button
Nov 21, 2019
Erich Elsen, Marat Dukhan, Trevor Gale, Karen Simonyan

Figure 1 for Fast Sparse ConvNets
Figure 2 for Fast Sparse ConvNets
Figure 3 for Fast Sparse ConvNets
Figure 4 for Fast Sparse ConvNets
Viaarxiv icon

The State of Sparsity in Deep Neural Networks

Add code
Bookmark button
Alert button
Feb 25, 2019
Trevor Gale, Erich Elsen, Sara Hooker

Figure 1 for The State of Sparsity in Deep Neural Networks
Figure 2 for The State of Sparsity in Deep Neural Networks
Figure 3 for The State of Sparsity in Deep Neural Networks
Figure 4 for The State of Sparsity in Deep Neural Networks
Viaarxiv icon