Picture for Peter Schneider-Kamp

Peter Schneider-Kamp

SommBench: Assessing Sommelier Expertise of Language Models

Add code
Mar 12, 2026
Viaarxiv icon

FlexMoRE: A Flexible Mixture of Rank-heterogeneous Experts for Efficient Federatedly-trained Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

Training Language Models to Use Prolog as a Tool

Add code
Dec 08, 2025
Viaarxiv icon

Achieving Hilbert-Schmidt Independence Under Rényi Differential Privacy for Fair and Private Data Generation

Add code
Aug 29, 2025
Figure 1 for Achieving Hilbert-Schmidt Independence Under Rényi Differential Privacy for Fair and Private Data Generation
Figure 2 for Achieving Hilbert-Schmidt Independence Under Rényi Differential Privacy for Fair and Private Data Generation
Figure 3 for Achieving Hilbert-Schmidt Independence Under Rényi Differential Privacy for Fair and Private Data Generation
Figure 4 for Achieving Hilbert-Schmidt Independence Under Rényi Differential Privacy for Fair and Private Data Generation
Viaarxiv icon

Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?

Add code
Feb 17, 2025
Figure 1 for Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
Figure 2 for Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
Figure 3 for Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
Figure 4 for Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
Viaarxiv icon

FlexDeMo: Decoupled Momentum Optimization for Fully and Hybrid Sharded Training

Add code
Feb 10, 2025
Figure 1 for FlexDeMo: Decoupled Momentum Optimization for Fully and Hybrid Sharded Training
Figure 2 for FlexDeMo: Decoupled Momentum Optimization for Fully and Hybrid Sharded Training
Figure 3 for FlexDeMo: Decoupled Momentum Optimization for Fully and Hybrid Sharded Training
Figure 4 for FlexDeMo: Decoupled Momentum Optimization for Fully and Hybrid Sharded Training
Viaarxiv icon

When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization

Add code
Nov 08, 2024
Figure 1 for When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization
Figure 2 for When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization
Figure 3 for When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization
Figure 4 for When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization
Viaarxiv icon

Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks

Add code
Jun 19, 2024
Figure 1 for Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Figure 2 for Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Figure 3 for Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Figure 4 for Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Viaarxiv icon

SynthEval: A Framework for Detailed Utility and Privacy Evaluation of Tabular Synthetic Data

Add code
Apr 24, 2024
Viaarxiv icon

Sharing is CAIRing: Characterizing Principles and Assessing Properties of Universal Privacy Evaluation for Synthetic Tabular Data

Add code
Dec 19, 2023
Figure 1 for Sharing is CAIRing: Characterizing Principles and Assessing Properties of Universal Privacy Evaluation for Synthetic Tabular Data
Figure 2 for Sharing is CAIRing: Characterizing Principles and Assessing Properties of Universal Privacy Evaluation for Synthetic Tabular Data
Figure 3 for Sharing is CAIRing: Characterizing Principles and Assessing Properties of Universal Privacy Evaluation for Synthetic Tabular Data
Figure 4 for Sharing is CAIRing: Characterizing Principles and Assessing Properties of Universal Privacy Evaluation for Synthetic Tabular Data
Viaarxiv icon