Picture for Madalina Ciobanu

Madalina Ciobanu

AdaDPO: Self-Adaptive Direct Preference Optimization with Balanced Gradient Updates

Add code
May 27, 2026
Viaarxiv icon

Interpretability-Guided Layer Selection over Subspace Projection: SAEs as Stethoscopes, Not Scalpels, for Raw Task Vector Model Editing

Add code
May 27, 2026
Viaarxiv icon

Application-Driven Pedagogical Knowledge Optimization of Open-Source LLMs via Reinforcement Learning and Supervised Fine-Tuning

Add code
Apr 07, 2026
Viaarxiv icon

State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation

Add code
Apr 07, 2026
Viaarxiv icon

OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models

Add code
Feb 29, 2024
Viaarxiv icon