Picture for Oliver Sieberling

Oliver Sieberling

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

Add code
Jun 05, 2025
Viaarxiv icon

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Add code
May 20, 2025
Viaarxiv icon

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Add code
Feb 11, 2025
Viaarxiv icon

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

Add code
Oct 18, 2024
Viaarxiv icon

Plus Strategies are Exponentially Slower for Planted Optima of Random Height

Add code
Apr 15, 2024
Figure 1 for Plus Strategies are Exponentially Slower for Planted Optima of Random Height
Figure 2 for Plus Strategies are Exponentially Slower for Planted Optima of Random Height
Figure 3 for Plus Strategies are Exponentially Slower for Planted Optima of Random Height
Figure 4 for Plus Strategies are Exponentially Slower for Planted Optima of Random Height
Viaarxiv icon

Hardest Monotone Functions for Evolutionary Algorithms

Add code
Nov 13, 2023
Viaarxiv icon