Language Modelling


How Do Language Models Acquire Character-Level Information?

Add code
Feb 05, 2026
Viaarxiv icon

JSynFlow: Japanese Synthesised Flowchart Visual Question Answering Dataset built with Large Language Models

Add code
Feb 05, 2026
Viaarxiv icon

Can vision language models learn intuitive physics from interaction?

Add code
Feb 05, 2026
Viaarxiv icon

Hallucination-Resistant Security Planning with a Large Language Model

Add code
Feb 05, 2026
Viaarxiv icon

Once Correct, Still Wrong: Counterfactual Hallucination in Multilingual Vision-Language Models

Add code
Feb 05, 2026
Viaarxiv icon

Cross-Lingual Empirical Evaluation of Large Language Models for Arabic Medical Tasks

Add code
Feb 05, 2026
Viaarxiv icon

Transport and Merge: Cross-Architecture Merging for Large Language Models

Add code
Feb 05, 2026
Viaarxiv icon

SciDef: Automating Definition Extraction from Academic Literature with Large Language Models

Add code
Feb 05, 2026
Viaarxiv icon

DLM-Scope: Mechanistic Interpretability of Diffusion Language Models via Sparse Autoencoders

Add code
Feb 05, 2026
Viaarxiv icon

CASTLE: A Comprehensive Benchmark for Evaluating Student-Tailored Personalized Safety in Large Language Models

Add code
Feb 05, 2026
Viaarxiv icon