Picture for Marius Mosbach

Marius Mosbach

Leveraging Routing Dynamics in Mixture-of-Experts Models for Efficient Language Adaptation

Add code
May 28, 2026
Viaarxiv icon

Forecasting Downstream Performance of LLMs With Proxy Metrics

Add code
May 18, 2026
Viaarxiv icon

The Illusion of Superposition? A Principled Analysis of Latent Thinking in Language Models

Add code
Apr 07, 2026
Viaarxiv icon

LLM2Vec-Gen: Generative Embeddings from Large Language Models

Add code
Mar 11, 2026
Viaarxiv icon

Operationalising the Superficial Alignment Hypothesis via Task Complexity

Add code
Feb 17, 2026
Viaarxiv icon

LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs

Add code
Jan 31, 2026
Viaarxiv icon

CLaS-Bench: A Cross-Lingual Alignment and Steering Benchmark

Add code
Jan 13, 2026
Viaarxiv icon

Do Generalisation Results Generalise?

Add code
Dec 08, 2025
Figure 1 for Do Generalisation Results Generalise?
Figure 2 for Do Generalisation Results Generalise?
Figure 3 for Do Generalisation Results Generalise?
Figure 4 for Do Generalisation Results Generalise?
Viaarxiv icon

Value Drifts: Tracing Value Alignment During LLM Post-Training

Add code
Oct 30, 2025
Viaarxiv icon

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Add code
Sep 03, 2025
Figure 1 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Figure 2 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Figure 3 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Figure 4 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Viaarxiv icon