Picture for Nikolaos Aletras

Nikolaos Aletras

Boundary-targeted Membership Inference Attacks on Safety Classifiers

Add code
May 21, 2026
Viaarxiv icon

Reasoning Dynamics and the Limits of Monitoring Modality Reliance in Vision-Language Models

Add code
Apr 16, 2026
Viaarxiv icon

Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models

Add code
Feb 09, 2026
Viaarxiv icon

An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift

Add code
Jan 09, 2026
Viaarxiv icon

Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks

Add code
Jan 06, 2026
Viaarxiv icon

Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?

Add code
Aug 27, 2025
Viaarxiv icon

Progressive Depth Up-scaling via Optimal Transport

Add code
Aug 11, 2025
Viaarxiv icon

Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision

Add code
May 26, 2025
Viaarxiv icon

GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations

Add code
May 22, 2025
Viaarxiv icon

Compressing Language Models for Specialized Domains

Add code
Feb 25, 2025
Viaarxiv icon