Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nick Jelicic

SHARE: Social-Humanities AI for Research and Education

Apr 13, 2026

João Gonçalves, Sonia de Jager, Petr Knoth, David Pride, Nick Jelicic

Abstract:This intermediate technical report introduces the SHARE family of base models and the MIRROR user interface. The SHARE models are the first causal language models fully pretrained by and for the social sciences and humanities (SSH). Their performance in modelling SSH texts is close to that of general purpose models (Phi-4) which use 100 times more tokens, as shown by our custom SSH Cloze benchmark. The MIRROR user interface is designed for reviewing text inputs from the SSH disciplines while preserving critical engagement. By prototyping a generative AI interface that does not generate any text, we propose a way to harness the capabilities of the SHARE models without compromising the integrity of SSH principles and norms.

* 23 pages, 9 figures, 4 tables

Via

Access Paper or Ask Questions

The advantages of context specific language models: the case of the Erasmian Language Model

Aug 13, 2024

João Gonçalves, Nick Jelicic, Michele Murgia, Evert Stamhuis

Figure 1 for The advantages of context specific language models: the case of the Erasmian Language Model

Figure 2 for The advantages of context specific language models: the case of the Erasmian Language Model

Figure 3 for The advantages of context specific language models: the case of the Erasmian Language Model

Figure 4 for The advantages of context specific language models: the case of the Erasmian Language Model

Abstract:The current trend to improve language model performance seems to be based on scaling up with the number of parameters (e.g. the state of the art GPT4 model has approximately 1.7 trillion parameters) or the amount of training data fed into the model. However this comes at significant costs in terms of computational resources and energy costs that compromise the sustainability of AI solutions, as well as risk relating to privacy and misuse. In this paper we present the Erasmian Language Model (ELM) a small context specific, 900 million parameter model, pre-trained and fine-tuned by and for Erasmus University Rotterdam. We show how the model performs adequately in a classroom context for essay writing, and how it achieves superior performance in subjects that are part of its context. This has implications for a wide range of institutions and organizations, showing that context specific language models may be a viable alternative for resource constrained, privacy sensitive use cases.

* 12 pages, 3 figures, 1 table

Via

Access Paper or Ask Questions