Alert button
Picture for Arthur Mensch

Arthur Mensch

Alert button

DMA, PARIETAL

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Dec 08, 2021
Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat McAleese, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-Baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien de Masson d'Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving

Figure 1 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 2 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 3 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 4 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Viaarxiv icon

Improving language models by retrieving from trillions of tokens

Dec 08, 2021
Sebastian Borgeaud, Arthur Mensch, Jordan Hoffmann, Trevor Cai, Eliza Rutherford, Katie Millican, George van den Driessche, Jean-Baptiste Lespiau, Bogdan Damoc, Aidan Clark, Diego de Las Casas, Aurelia Guy, Jacob Menick, Roman Ring, Tom Hennigan, Saffron Huang, Loren Maggiore, Chris Jones, Albin Cassirer, Andy Brock, Michela Paganini, Geoffrey Irving, Oriol Vinyals, Simon Osindero, Karen Simonyan, Jack W. Rae, Erich Elsen, Laurent Sifre

Figure 1 for Improving language models by retrieving from trillions of tokens
Figure 2 for Improving language models by retrieving from trillions of tokens
Figure 3 for Improving language models by retrieving from trillions of tokens
Figure 4 for Improving language models by retrieving from trillions of tokens
Viaarxiv icon

Differentiable Divergences Between Time Series

Oct 16, 2020
Mathieu Blondel, Arthur Mensch, Jean-Philippe Vert

Figure 1 for Differentiable Divergences Between Time Series
Figure 2 for Differentiable Divergences Between Time Series
Figure 3 for Differentiable Divergences Between Time Series
Figure 4 for Differentiable Divergences Between Time Series
Viaarxiv icon

Fine-grain atlases of functional modes for fMRI analysis

Mar 05, 2020
Kamalaker Dadi, Gaël Varoquaux, Antonia Machlouzarides-Shalit, Krzysztof J. Gorgolewski, Demian Wassermann, Bertrand Thirion, Arthur Mensch

Figure 1 for Fine-grain atlases of functional modes for fMRI analysis
Figure 2 for Fine-grain atlases of functional modes for fMRI analysis
Figure 3 for Fine-grain atlases of functional modes for fMRI analysis
Figure 4 for Fine-grain atlases of functional modes for fMRI analysis
Viaarxiv icon

Online Sinkhorn: optimal transportation distances from sample streams

Mar 03, 2020
Arthur Mensch, Gabriel Peyré

Figure 1 for Online Sinkhorn: optimal transportation distances from sample streams
Figure 2 for Online Sinkhorn: optimal transportation distances from sample streams
Figure 3 for Online Sinkhorn: optimal transportation distances from sample streams
Figure 4 for Online Sinkhorn: optimal transportation distances from sample streams
Viaarxiv icon

A mean-field analysis of two-player zero-sum games

Feb 24, 2020
Carles Domingo-Enrich, Samy Jelassi, Arthur Mensch, Grant Rotskoff, Joan Bruna

Figure 1 for A mean-field analysis of two-player zero-sum games
Figure 2 for A mean-field analysis of two-player zero-sum games
Figure 3 for A mean-field analysis of two-player zero-sum games
Figure 4 for A mean-field analysis of two-player zero-sum games
Viaarxiv icon

Extra-gradient with player sampling for provable fast convergence in n-player games

Jun 04, 2019
Carles Domingo Enrich, Samy Jelassi, Domingo Carles, Damien Scieur, Arthur Mensch, Joan Bruna

Figure 1 for Extra-gradient with player sampling for provable fast convergence in n-player games
Figure 2 for Extra-gradient with player sampling for provable fast convergence in n-player games
Figure 3 for Extra-gradient with player sampling for provable fast convergence in n-player games
Figure 4 for Extra-gradient with player sampling for provable fast convergence in n-player games
Viaarxiv icon

Geometric Losses for Distributional Learning

May 15, 2019
Arthur Mensch, Mathieu Blondel, Gabriel Peyré

Figure 1 for Geometric Losses for Distributional Learning
Figure 2 for Geometric Losses for Distributional Learning
Figure 3 for Geometric Losses for Distributional Learning
Figure 4 for Geometric Losses for Distributional Learning
Viaarxiv icon