Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP



Sabrina J. Mielke , Zaid Alyafeai , Elizabeth Salesky , Colin Raffel , Manan Dey , Matthias Gallé , Arun Raja , Chenglei Si , Wilson Y. Lee , Benoît Sagot , Samson Tan

* 15 page preprint 

   Access Paper or Ask Questions