Picture for Hammam Abdelwahab

Hammam Abdelwahab

Textual Data Bias Detection and Mitigation -- An Extensible Pipeline with Experimental Evaluation

Add code
Dec 12, 2025
Viaarxiv icon

Data Processing for the OpenGPT-X Model Family

Add code
Oct 11, 2024
Figure 1 for Data Processing for the OpenGPT-X Model Family
Figure 2 for Data Processing for the OpenGPT-X Model Family
Figure 3 for Data Processing for the OpenGPT-X Model Family
Figure 4 for Data Processing for the OpenGPT-X Model Family
Viaarxiv icon

Tokenizer Choice For LLM Training: Negligible or Crucial?

Add code
Oct 18, 2023
Figure 1 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Figure 2 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Figure 3 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Figure 4 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Viaarxiv icon