Alert button
Picture for Łukasz Kaiser

Łukasz Kaiser

Alert button

tsGT: Stochastic Time Series Modeling With Transformer

Add code
Bookmark button
Alert button
Mar 15, 2024
Łukasz Kuciński, Witold Drzewakowski, Mateusz Olko, Piotr Kozakowski, Łukasz Maziarka, Marta Emilia Nowakowska, Łukasz Kaiser, Piotr Miłoś

Figure 1 for tsGT: Stochastic Time Series Modeling With Transformer
Figure 2 for tsGT: Stochastic Time Series Modeling With Transformer
Figure 3 for tsGT: Stochastic Time Series Modeling With Transformer
Figure 4 for tsGT: Stochastic Time Series Modeling With Transformer
Viaarxiv icon

Sparse is Enough in Scaling Transformers

Add code
Bookmark button
Alert button
Nov 24, 2021
Sebastian Jaszczur, Aakanksha Chowdhery, Afroz Mohiuddin, Łukasz Kaiser, Wojciech Gajewski, Henryk Michalewski, Jonni Kanerva

Figure 1 for Sparse is Enough in Scaling Transformers
Figure 2 for Sparse is Enough in Scaling Transformers
Figure 3 for Sparse is Enough in Scaling Transformers
Figure 4 for Sparse is Enough in Scaling Transformers
Viaarxiv icon

Hierarchical Transformers Are More Efficient Language Models

Add code
Bookmark button
Alert button
Oct 26, 2021
Piotr Nawrot, Szymon Tworkowski, Michał Tyrolski, Łukasz Kaiser, Yuhuai Wu, Christian Szegedy, Henryk Michalewski

Figure 1 for Hierarchical Transformers Are More Efficient Language Models
Figure 2 for Hierarchical Transformers Are More Efficient Language Models
Figure 3 for Hierarchical Transformers Are More Efficient Language Models
Figure 4 for Hierarchical Transformers Are More Efficient Language Models
Viaarxiv icon

Q-Value Weighted Regression: Reinforcement Learning with Limited Data

Add code
Bookmark button
Alert button
Feb 12, 2021
Piotr Kozakowski, Łukasz Kaiser, Henryk Michalewski, Afroz Mohiuddin, Katarzyna Kańska

Figure 1 for Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Figure 2 for Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Figure 3 for Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Figure 4 for Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Viaarxiv icon

Reformer: The Efficient Transformer

Add code
Bookmark button
Alert button
Feb 18, 2020
Nikita Kitaev, Łukasz Kaiser, Anselm Levskaya

Figure 1 for Reformer: The Efficient Transformer
Figure 2 for Reformer: The Efficient Transformer
Figure 3 for Reformer: The Efficient Transformer
Figure 4 for Reformer: The Efficient Transformer
Viaarxiv icon

Universal Transformers

Add code
Bookmark button
Alert button
Jul 10, 2018
Mostafa Dehghani, Stephan Gouws, Oriol Vinyals, Jakob Uszkoreit, Łukasz Kaiser

Figure 1 for Universal Transformers
Figure 2 for Universal Transformers
Figure 3 for Universal Transformers
Figure 4 for Universal Transformers
Viaarxiv icon

Image Transformer

Add code
Bookmark button
Alert button
Jun 15, 2018
Niki Parmar, Ashish Vaswani, Jakob Uszkoreit, Łukasz Kaiser, Noam Shazeer, Alexander Ku, Dustin Tran

Figure 1 for Image Transformer
Figure 2 for Image Transformer
Figure 3 for Image Transformer
Figure 4 for Image Transformer
Viaarxiv icon

Fast Decoding in Sequence Models using Discrete Latent Variables

Add code
Bookmark button
Alert button
Jun 07, 2018
Łukasz Kaiser, Aurko Roy, Ashish Vaswani, Niki Parmar, Samy Bengio, Jakob Uszkoreit, Noam Shazeer

Figure 1 for Fast Decoding in Sequence Models using Discrete Latent Variables
Figure 2 for Fast Decoding in Sequence Models using Discrete Latent Variables
Figure 3 for Fast Decoding in Sequence Models using Discrete Latent Variables
Figure 4 for Fast Decoding in Sequence Models using Discrete Latent Variables
Viaarxiv icon

Tensor2Tensor for Neural Machine Translation

Add code
Bookmark button
Alert button
Mar 16, 2018
Ashish Vaswani, Samy Bengio, Eugene Brevdo, Francois Chollet, Aidan N. Gomez, Stephan Gouws, Llion Jones, Łukasz Kaiser, Nal Kalchbrenner, Niki Parmar, Ryan Sepassi, Noam Shazeer, Jakob Uszkoreit

Figure 1 for Tensor2Tensor for Neural Machine Translation
Viaarxiv icon