Picture for Björn Deiseroth

Björn Deiseroth

Bounding Hallucinations: Information-Theoretic Guarantees for RAG Systems via Merlin-Arthur Protocols

Add code
Dec 12, 2025
Viaarxiv icon

LIME: Making LLM Data More Efficient with Linguistic Metadata Embeddings

Add code
Dec 08, 2025
Viaarxiv icon

Measuring and Guiding Monosemanticity

Add code
Jun 24, 2025
Viaarxiv icon

Aleph-Alpha-GermanWeb: Improving German-language LLM pre-training with model-based data curation and synthetic data generation

Add code
Apr 24, 2025
Viaarxiv icon

Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models

Add code
Jan 17, 2025
Figure 1 for Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models
Figure 2 for Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models
Figure 3 for Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models
Figure 4 for Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models
Viaarxiv icon

SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Add code
Nov 11, 2024
Viaarxiv icon

u-$μ$P: The Unit-Scaled Maximal Update Parametrization

Add code
Jul 24, 2024
Viaarxiv icon

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Add code
Jun 27, 2024
Figure 1 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 2 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 3 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 4 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Viaarxiv icon

Mechanistic Design and Scaling of Hybrid Architectures

Add code
Mar 26, 2024
Figure 1 for Mechanistic Design and Scaling of Hybrid Architectures
Figure 2 for Mechanistic Design and Scaling of Hybrid Architectures
Figure 3 for Mechanistic Design and Scaling of Hybrid Architectures
Figure 4 for Mechanistic Design and Scaling of Hybrid Architectures
Viaarxiv icon

Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization

Add code
Nov 13, 2023
Viaarxiv icon