Alert button
Picture for Michael Matena

Michael Matena

Alert button

NPEFF: Non-Negative Per-Example Fisher Factorization

Add code
Bookmark button
Alert button
Oct 07, 2023
Michael Matena, Colin Raffel

Figure 1 for NPEFF: Non-Negative Per-Example Fisher Factorization
Figure 2 for NPEFF: Non-Negative Per-Example Fisher Factorization
Figure 3 for NPEFF: Non-Negative Per-Example Fisher Factorization
Figure 4 for NPEFF: Non-Negative Per-Example Fisher Factorization
Viaarxiv icon

A Combinatorial Perspective on the Optimization of Shallow ReLU Networks

Add code
Bookmark button
Alert button
Oct 01, 2022
Michael Matena, Colin Raffel

Figure 1 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Figure 2 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Figure 3 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Figure 4 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Viaarxiv icon

Merging Models with Fisher-Weighted Averaging

Add code
Bookmark button
Alert button
Nov 18, 2021
Michael Matena, Colin Raffel

Figure 1 for Merging Models with Fisher-Weighted Averaging
Figure 2 for Merging Models with Fisher-Weighted Averaging
Figure 3 for Merging Models with Fisher-Weighted Averaging
Figure 4 for Merging Models with Fisher-Weighted Averaging
Viaarxiv icon

Do Transformer Modifications Transfer Across Implementations and Applications?

Add code
Bookmark button
Alert button
Feb 23, 2021
Sharan Narang, Hyung Won Chung, Yi Tay, William Fedus, Thibault Fevry, Michael Matena, Karishma Malkan, Noah Fiedel, Noam Shazeer, Zhenzhong Lan, Yanqi Zhou, Wei Li, Nan Ding, Jake Marcus, Adam Roberts, Colin Raffel

Figure 1 for Do Transformer Modifications Transfer Across Implementations and Applications?
Figure 2 for Do Transformer Modifications Transfer Across Implementations and Applications?
Figure 3 for Do Transformer Modifications Transfer Across Implementations and Applications?
Viaarxiv icon

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Add code
Bookmark button
Alert button
Oct 24, 2019
Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu

Figure 1 for Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Figure 2 for Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Figure 3 for Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Figure 4 for Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Viaarxiv icon