Alert button
Picture for Jeff Dean

Jeff Dean

Alert button

Gemma: Open Models Based on Gemini Research and Technology

Add code
Bookmark button
Alert button
Mar 13, 2024
Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari, Charline Le Lan, Christopher A. Choquette-Choo, Clément Crepy, Daniel Cer, Daphne Ippolito, David Reid, Elena Buchatskaya, Eric Ni, Eric Noland, Geng Yan, George Tucker, George-Christian Muraru, Grigory Rozhdestvenskiy, Henryk Michalewski, Ian Tenney, Ivan Grishchenko, Jacob Austin, James Keeling, Jane Labanowski, Jean-Baptiste Lespiau, Jeff Stanway, Jenny Brennan, Jeremy Chen, Johan Ferret, Justin Chiu, Justin Mao-Jones, Katherine Lee, Kathy Yu, Katie Millican, Lars Lowe Sjoesund, Lisa Lee, Lucas Dixon, Machel Reid, Maciej Mikuła, Mateo Wirth, Michael Sharman, Nikolai Chinaev, Nithum Thain, Olivier Bachem, Oscar Chang, Oscar Wahltinez, Paige Bailey, Paul Michel, Petko Yotov, Pier Giuseppe Sessa, Rahma Chaabouni, Ramona Comanescu, Reena Jana, Rohan Anil, Ross McIlroy, Ruibo Liu, Ryan Mullins, Samuel L Smith, Sebastian Borgeaud, Sertan Girgin, Sholto Douglas, Shree Pandya, Siamak Shakeri, Soham De, Ted Klimenko, Tom Hennigan, Vlad Feinberg, Wojciech Stokowiec, Yu-hui Chen, Zafarali Ahmed, Zhitao Gong, Tris Warkentin, Ludovic Peran, Minh Giang, Clément Farabet, Oriol Vinyals, Jeff Dean, Koray Kavukcuoglu, Demis Hassabis, Zoubin Ghahramani, Douglas Eck, Joelle Barral, Fernando Pereira, Eli Collins, Armand Joulin, Noah Fiedel, Evan Senter, Alek Andreev, Kathleen Kenealy

Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Brainformers: Trading Simplicity for Efficiency

Add code
Bookmark button
Alert button
May 29, 2023
Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laundon, Jeff Dean

Figure 1 for Brainformers: Trading Simplicity for Efficiency
Figure 2 for Brainformers: Trading Simplicity for Efficiency
Figure 3 for Brainformers: Trading Simplicity for Efficiency
Figure 4 for Brainformers: Trading Simplicity for Efficiency
Viaarxiv icon

Efficiently Scaling Transformer Inference

Add code
Bookmark button
Alert button
Nov 09, 2022
Reiner Pope, Sholto Douglas, Aakanksha Chowdhery, Jacob Devlin, James Bradbury, Anselm Levskaya, Jonathan Heek, Kefan Xiao, Shivani Agrawal, Jeff Dean

Figure 1 for Efficiently Scaling Transformer Inference
Figure 2 for Efficiently Scaling Transformer Inference
Figure 3 for Efficiently Scaling Transformer Inference
Figure 4 for Efficiently Scaling Transformer Inference
Viaarxiv icon

Scaling Instruction-Finetuned Language Models

Add code
Bookmark button
Alert button
Oct 20, 2022
Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Eric Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Zhao, Yanping Huang, Andrew Dai, Hongkun Yu, Slav Petrov, Ed H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, Jason Wei

Figure 1 for Scaling Instruction-Finetuned Language Models
Figure 2 for Scaling Instruction-Finetuned Language Models
Figure 3 for Scaling Instruction-Finetuned Language Models
Figure 4 for Scaling Instruction-Finetuned Language Models
Viaarxiv icon

A Review of Sparse Expert Models in Deep Learning

Add code
Bookmark button
Alert button
Sep 04, 2022
William Fedus, Jeff Dean, Barret Zoph

Figure 1 for A Review of Sparse Expert Models in Deep Learning
Figure 2 for A Review of Sparse Expert Models in Deep Learning
Figure 3 for A Review of Sparse Expert Models in Deep Learning
Figure 4 for A Review of Sparse Expert Models in Deep Learning
Viaarxiv icon

Emergent Abilities of Large Language Models

Add code
Bookmark button
Alert button
Jun 15, 2022
Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus

Figure 1 for Emergent Abilities of Large Language Models
Figure 2 for Emergent Abilities of Large Language Models
Figure 3 for Emergent Abilities of Large Language Models
Figure 4 for Emergent Abilities of Large Language Models
Viaarxiv icon

An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems

Add code
Bookmark button
Alert button
Jun 05, 2022
Andrea Gesmundo, Jeff Dean

Figure 1 for An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems
Figure 2 for An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems
Figure 3 for An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems
Figure 4 for An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems
Viaarxiv icon

muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems

Add code
Bookmark button
Alert button
May 25, 2022
Andrea Gesmundo, Jeff Dean

Figure 1 for muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems
Figure 2 for muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems
Figure 3 for muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems
Figure 4 for muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems
Viaarxiv icon