Alert button
Picture for Jared Davis

Jared Davis

Alert button

Debiasing a First-order Heuristic for Approximate Bi-level Optimization

Add code
Bookmark button
Alert button
Jun 08, 2021
Valerii Likhosherstov, Xingyou Song, Krzysztof Choromanski, Jared Davis, Adrian Weller

Figure 1 for Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Figure 2 for Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Figure 3 for Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Figure 4 for Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Viaarxiv icon

Sub-Linear Memory: How to Make Performers SLiM

Add code
Bookmark button
Alert button
Dec 21, 2020
Valerii Likhosherstov, Krzysztof Choromanski, Jared Davis, Xingyou Song, Adrian Weller

Figure 1 for Sub-Linear Memory: How to Make Performers SLiM
Figure 2 for Sub-Linear Memory: How to Make Performers SLiM
Figure 3 for Sub-Linear Memory: How to Make Performers SLiM
Figure 4 for Sub-Linear Memory: How to Make Performers SLiM
Viaarxiv icon

Rethinking Attention with Performers

Add code
Bookmark button
Alert button
Sep 30, 2020
Krzysztof Choromanski, Valerii Likhosherstov, David Dohan, Xingyou Song, Andreea Gane, Tamas Sarlos, Peter Hawkins, Jared Davis, Afroz Mohiuddin, Lukasz Kaiser, David Belanger, Lucy Colwell, Adrian Weller

Figure 1 for Rethinking Attention with Performers
Figure 2 for Rethinking Attention with Performers
Figure 3 for Rethinking Attention with Performers
Figure 4 for Rethinking Attention with Performers
Viaarxiv icon

UFO-BLO: Unbiased First-Order Bilevel Optimization

Add code
Bookmark button
Alert button
Jun 05, 2020
Valerii Likhosherstov, Xingyou Song, Krzysztof Choromanski, Jared Davis, Adrian Weller

Figure 1 for UFO-BLO: Unbiased First-Order Bilevel Optimization
Figure 2 for UFO-BLO: Unbiased First-Order Bilevel Optimization
Figure 3 for UFO-BLO: Unbiased First-Order Bilevel Optimization
Figure 4 for UFO-BLO: Unbiased First-Order Bilevel Optimization
Viaarxiv icon

Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers

Add code
Bookmark button
Alert button
Jun 05, 2020
Krzysztof Choromanski, Valerii Likhosherstov, David Dohan, Xingyou Song, Jared Davis, Tamas Sarlos, David Belanger, Lucy Colwell, Adrian Weller

Figure 1 for Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Figure 2 for Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Figure 3 for Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Figure 4 for Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Viaarxiv icon

CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices

Add code
Bookmark button
Alert button
Apr 18, 2020
Valerii Likhosherstov, Jared Davis, Krzysztof Choromanski, Adrian Weller

Figure 1 for CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices
Figure 2 for CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices
Figure 3 for CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices
Figure 4 for CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices
Viaarxiv icon

Stochastic Flows and Geometric Optimization on the Orthogonal Group

Add code
Bookmark button
Alert button
Mar 30, 2020
Krzysztof Choromanski, David Cheikhi, Jared Davis, Valerii Likhosherstov, Achille Nazaret, Achraf Bahamou, Xingyou Song, Mrugank Akarte, Jack Parker-Holder, Jacob Bergquist, Yuan Gao, Aldo Pacchiano, Tamas Sarlos, Adrian Weller, Vikas Sindhwani

Figure 1 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Figure 2 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Figure 3 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Figure 4 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Viaarxiv icon