Picture for Serry Sibaee

Serry Sibaee

SARD: A Large-Scale Synthetic Arabic OCR Dataset for Book-Style Text Recognition

Add code
May 30, 2025
Viaarxiv icon

GATE: General Arabic Text Embedding for Enhanced Semantic Textual Similarity with Matryoshka Representation Learning and Hybrid Loss Training

Add code
May 30, 2025
Viaarxiv icon

Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset

Add code
May 28, 2025
Viaarxiv icon

Advancing Arabic Reverse Dictionary Systems: A Transformer-Based Approach with Dataset Construction Guidelines

Add code
Apr 30, 2025
Viaarxiv icon

LLMs as Compiler for Arabic Programming Language

Add code
Mar 24, 2024
Viaarxiv icon

ArabianGPT: Native Arabic GPT-based Large Language Model

Add code
Feb 26, 2024
Viaarxiv icon

Prediction of Arabic Legal Rulings using Large Language Models

Add code
Oct 16, 2023
Viaarxiv icon