Alert button
Picture for Noam Shazeer

Noam Shazeer

Alert button

Attention Is All You Need

Add code
Bookmark button
Alert button
Dec 06, 2017
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

Figure 1 for Attention Is All You Need
Figure 2 for Attention Is All You Need
Figure 3 for Attention Is All You Need
Figure 4 for Attention Is All You Need
Viaarxiv icon

One Model To Learn Them All

Add code
Bookmark button
Alert button
Jun 16, 2017
Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki Parmar, Llion Jones, Jakob Uszkoreit

Figure 1 for One Model To Learn Them All
Figure 2 for One Model To Learn Them All
Figure 3 for One Model To Learn Them All
Figure 4 for One Model To Learn Them All
Viaarxiv icon

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

Add code
Bookmark button
Alert button
Jan 23, 2017
Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean

Figure 1 for Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Figure 2 for Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Figure 3 for Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Figure 4 for Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Viaarxiv icon

NN-grams: Unifying neural network and n-gram language models for Speech Recognition

Add code
Bookmark button
Alert button
Jun 23, 2016
Babak Damavandi, Shankar Kumar, Noam Shazeer, Antoine Bruguier

Figure 1 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 2 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 3 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 4 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Viaarxiv icon

Exploring the Limits of Language Modeling

Add code
Bookmark button
Alert button
Feb 11, 2016
Rafal Jozefowicz, Oriol Vinyals, Mike Schuster, Noam Shazeer, Yonghui Wu

Figure 1 for Exploring the Limits of Language Modeling
Figure 2 for Exploring the Limits of Language Modeling
Figure 3 for Exploring the Limits of Language Modeling
Figure 4 for Exploring the Limits of Language Modeling
Viaarxiv icon

Swivel: Improving Embeddings by Noticing What's Missing

Add code
Bookmark button
Alert button
Feb 06, 2016
Noam Shazeer, Ryan Doherty, Colin Evans, Chris Waterson

Figure 1 for Swivel: Improving Embeddings by Noticing What's Missing
Figure 2 for Swivel: Improving Embeddings by Noticing What's Missing
Figure 3 for Swivel: Improving Embeddings by Noticing What's Missing
Figure 4 for Swivel: Improving Embeddings by Noticing What's Missing
Viaarxiv icon

End-to-End Text-Dependent Speaker Verification

Add code
Bookmark button
Alert button
Sep 27, 2015
Georg Heigold, Ignacio Moreno, Samy Bengio, Noam Shazeer

Figure 1 for End-to-End Text-Dependent Speaker Verification
Figure 2 for End-to-End Text-Dependent Speaker Verification
Figure 3 for End-to-End Text-Dependent Speaker Verification
Figure 4 for End-to-End Text-Dependent Speaker Verification
Viaarxiv icon

Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks

Add code
Bookmark button
Alert button
Sep 23, 2015
Samy Bengio, Oriol Vinyals, Navdeep Jaitly, Noam Shazeer

Figure 1 for Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Figure 2 for Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Figure 3 for Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Figure 4 for Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Viaarxiv icon

Skip-gram Language Modeling Using Sparse Non-negative Matrix Probability Estimation

Add code
Bookmark button
Alert button
Jun 26, 2015
Noam Shazeer, Joris Pelemans, Ciprian Chelba

Figure 1 for Skip-gram Language Modeling Using Sparse Non-negative Matrix Probability Estimation
Figure 2 for Skip-gram Language Modeling Using Sparse Non-negative Matrix Probability Estimation
Figure 3 for Skip-gram Language Modeling Using Sparse Non-negative Matrix Probability Estimation
Figure 4 for Skip-gram Language Modeling Using Sparse Non-negative Matrix Probability Estimation
Viaarxiv icon

Variational Program Inference

Add code
Bookmark button
Alert button
Jun 04, 2010
Georges Harik, Noam Shazeer

Viaarxiv icon