Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Single Headed Attention RNN: Stop Thinking With Your Head

Nov 27, 2019
Stephen Merity

* Addition of citations and contextual results (no attention head, single attention head, attention per layer), removal of wordpiece WikiText-103 numbers due to normalization issues, fix of SHA attention figure Q arrow, other minor fixes 

  Access Paper or Ask Questions

An Analysis of Neural Language Modeling at Multiple Scales

Mar 22, 2018
Stephen Merity, Nitish Shirish Keskar, Richard Socher


  Access Paper or Ask Questions

A Flexible Approach to Automated RNN Architecture Generation

Dec 20, 2017
Martin Schrimpf, Stephen Merity, James Bradbury, Richard Socher


  Access Paper or Ask Questions

Regularizing and Optimizing LSTM Language Models

Aug 07, 2017
Stephen Merity, Nitish Shirish Keskar, Richard Socher


  Access Paper or Ask Questions

Revisiting Activation Regularization for Language RNNs

Aug 03, 2017
Stephen Merity, Bryan McCann, Richard Socher


  Access Paper or Ask Questions

Quasi-Recurrent Neural Networks

Nov 21, 2016
James Bradbury, Stephen Merity, Caiming Xiong, Richard Socher

* Submitted to conference track at ICLR 2017 

  Access Paper or Ask Questions

Pointer Sentinel Mixture Models

Sep 26, 2016
Stephen Merity, Caiming Xiong, James Bradbury, Richard Socher


  Access Paper or Ask Questions

Dynamic Memory Networks for Visual and Textual Question Answering

Mar 04, 2016
Caiming Xiong, Stephen Merity, Richard Socher


  Access Paper or Ask Questions