Alert button
Picture for Max Lübbering

Max Lübbering

Alert button

Tokenizer Choice For LLM Training: Negligible or Crucial?

Add code
Bookmark button
Alert button
Oct 18, 2023
Mehdi Ali, Michael Fromm, Klaudia Thellmann, Richard Rutmann, Max Lübbering, Johannes Leveling, Katrin Klug, Jan Ebert, Niclas Doll, Jasper Schulze Buschhoff, Charvi Jain, Alexander Arno Weber, Lena Jurkschat, Hammam Abdelwahab, Chelsea John, Pedro Ortiz Suarez, Malte Ostendorff, Samuel Weinbach, Rafet Sifa, Stefan Kesselheim, Nicolas Flores-Herr

Figure 1 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Figure 2 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Figure 3 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Figure 4 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Viaarxiv icon

Towards Supervised Extractive Text Summarization via RNN-based Sequence Classification

Add code
Bookmark button
Alert button
Nov 13, 2019
Eduardo Brito, Max Lübbering, David Biesner, Lars Patrick Hillebrand, Christian Bauckhage

Figure 1 for Towards Supervised Extractive Text Summarization via RNN-based Sequence Classification
Figure 2 for Towards Supervised Extractive Text Summarization via RNN-based Sequence Classification
Viaarxiv icon