Alert button
Picture for Zoubin Ghahramani

Zoubin Ghahramani

Alert button

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Bookmark button
Alert button
Apr 11, 2024
Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Armand Joulin, Noah Fiedel, Evan Senter, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, David Budden, Arnaud Doucet, Sharad Vikram, Adam Paszke, Trevor Gale, Sebastian Borgeaud, Charlie Chen, Andy Brock, Antonia Paterson, Jenny Brennan, Meg Risdal, Raj Gundluru, Nesh Devanathan, Paul Mooney, Nilay Chauhan, Phil Culliton, Luiz GUStavo Martins, Elisa Bandy, David Huntsperger, Glenn Cameron, Arthur Zucker, Tris Warkentin, Ludovic Peran, Minh Giang, Zoubin Ghahramani, Clément Farabet, Koray Kavukcuoglu, Demis Hassabis, Raia Hadsell, Yee Whye Teh, Nando de Frietas

Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Bookmark button
Alert button
Mar 13, 2024
Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari, Charline Le Lan, Christopher A. Choquette-Choo, Clément Crepy, Daniel Cer, Daphne Ippolito, David Reid, Elena Buchatskaya, Eric Ni, Eric Noland, Geng Yan, George Tucker, George-Christian Muraru, Grigory Rozhdestvenskiy, Henryk Michalewski, Ian Tenney, Ivan Grishchenko, Jacob Austin, James Keeling, Jane Labanowski, Jean-Baptiste Lespiau, Jeff Stanway, Jenny Brennan, Jeremy Chen, Johan Ferret, Justin Chiu, Justin Mao-Jones, Katherine Lee, Kathy Yu, Katie Millican, Lars Lowe Sjoesund, Lisa Lee, Lucas Dixon, Machel Reid, Maciej Mikuła, Mateo Wirth, Michael Sharman, Nikolai Chinaev, Nithum Thain, Olivier Bachem, Oscar Chang, Oscar Wahltinez, Paige Bailey, Paul Michel, Petko Yotov, Pier Giuseppe Sessa, Rahma Chaabouni, Ramona Comanescu, Reena Jana, Rohan Anil, Ross McIlroy, Ruibo Liu, Ryan Mullins, Samuel L Smith, Sebastian Borgeaud, Sertan Girgin, Sholto Douglas, Shree Pandya, Siamak Shakeri, Soham De, Ted Klimenko, Tom Hennigan, Vlad Feinberg, Wojciech Stokowiec, Yu-hui Chen, Zafarali Ahmed, Zhitao Gong, Tris Warkentin, Ludovic Peran, Minh Giang, Clément Farabet, Oriol Vinyals, Jeff Dean, Koray Kavukcuoglu, Demis Hassabis, Zoubin Ghahramani, Douglas Eck, Joelle Barral, Fernando Pereira, Eli Collins, Armand Joulin, Noah Fiedel, Evan Senter, Alek Andreev, Kathleen Kenealy

Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Plex: Towards Reliability using Pretrained Large Model Extensions

Add code
Bookmark button
Alert button
Jul 15, 2022
Dustin Tran, Jeremiah Liu, Michael W. Dusenberry, Du Phan, Mark Collier, Jie Ren, Kehang Han, Zi Wang, Zelda Mariet, Huiyi Hu, Neil Band, Tim G. J. Rudner, Karan Singhal, Zachary Nado, Joost van Amersfoort, Andreas Kirsch, Rodolphe Jenatton, Nithum Thain, Honglin Yuan, Kelly Buchanan, Kevin Murphy, D. Sculley, Yarin Gal, Zoubin Ghahramani, Jasper Snoek, Balaji Lakshminarayanan

Figure 1 for Plex: Towards Reliability using Pretrained Large Model Extensions
Figure 2 for Plex: Towards Reliability using Pretrained Large Model Extensions
Figure 3 for Plex: Towards Reliability using Pretrained Large Model Extensions
Figure 4 for Plex: Towards Reliability using Pretrained Large Model Extensions
Viaarxiv icon

Pre-training helps Bayesian optimization too

Add code
Bookmark button
Alert button
Jul 07, 2022
Zi Wang, George E. Dahl, Kevin Swersky, Chansoo Lee, Zelda Mariet, Zachary Nado, Justin Gilmer, Jasper Snoek, Zoubin Ghahramani

Figure 1 for Pre-training helps Bayesian optimization too
Figure 2 for Pre-training helps Bayesian optimization too
Figure 3 for Pre-training helps Bayesian optimization too
Figure 4 for Pre-training helps Bayesian optimization too
Viaarxiv icon

Neural Diffusion Processes

Add code
Bookmark button
Alert button
Jun 08, 2022
Vincent Dutordoir, Alan Saul, Zoubin Ghahramani, Fergus Simpson

Figure 1 for Neural Diffusion Processes
Figure 2 for Neural Diffusion Processes
Figure 3 for Neural Diffusion Processes
Figure 4 for Neural Diffusion Processes
Viaarxiv icon

Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers

Add code
Bookmark button
Alert button
Sep 16, 2021
Zi Wang, George E. Dahl, Kevin Swersky, Chansoo Lee, Zelda Mariet, Zack Nado, Justin Gilmer, Jasper Snoek, Zoubin Ghahramani

Figure 1 for Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers
Figure 2 for Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers
Figure 3 for Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers
Figure 4 for Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers
Viaarxiv icon

Deep Neural Networks as Point Estimates for Deep Gaussian Processes

Add code
Bookmark button
Alert button
May 10, 2021
Vincent Dutordoir, James Hensman, Mark van der Wilk, Carl Henrik Ek, Zoubin Ghahramani, Nicolas Durrande

Figure 1 for Deep Neural Networks as Point Estimates for Deep Gaussian Processes
Figure 2 for Deep Neural Networks as Point Estimates for Deep Gaussian Processes
Figure 3 for Deep Neural Networks as Point Estimates for Deep Gaussian Processes
Figure 4 for Deep Neural Networks as Point Estimates for Deep Gaussian Processes
Viaarxiv icon

Learning Continuous Treatment Policy and Bipartite Embeddings for Matching with Heterogeneous Causal Effects

Add code
Bookmark button
Alert button
Apr 21, 2020
Will Y. Zou, Smitha Shyam, Michael Mui, Mingshi Wang, Jan Pedersen, Zoubin Ghahramani

Figure 1 for Learning Continuous Treatment Policy and Bipartite Embeddings for Matching with Heterogeneous Causal Effects
Figure 2 for Learning Continuous Treatment Policy and Bipartite Embeddings for Matching with Heterogeneous Causal Effects
Figure 3 for Learning Continuous Treatment Policy and Bipartite Embeddings for Matching with Heterogeneous Causal Effects
Figure 4 for Learning Continuous Treatment Policy and Bipartite Embeddings for Matching with Heterogeneous Causal Effects
Viaarxiv icon

Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits

Add code
Bookmark button
Alert button
Apr 13, 2020
Robert Peharz, Steven Lang, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Guy Van den Broeck, Kristian Kersting, Zoubin Ghahramani

Figure 1 for Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits
Figure 2 for Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits
Figure 3 for Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits
Figure 4 for Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits
Viaarxiv icon

DynamicPPL: Stan-like Speed for Dynamic Probabilistic Models

Add code
Bookmark button
Alert button
Feb 07, 2020
Mohamed Tarek, Kai Xu, Martin Trapp, Hong Ge, Zoubin Ghahramani

Figure 1 for DynamicPPL: Stan-like Speed for Dynamic Probabilistic Models
Viaarxiv icon