Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Mashup Learning: Faster Finetuning by Remixing Past Checkpoints

Mar 10, 2026

Sofia Maria Lo Cicero Vaina, Artem Chumachenko, Max Ryabinin

Share this with someone who'll enjoy it:

Abstract:Finetuning on domain-specific data is a well-established method for enhancing LLM performance on downstream tasks. Training on each dataset produces a new set of model weights, resulting in a multitude of checkpoints saved in-house or on open-source platforms. However, these training artifacts are rarely reused for subsequent experiments despite containing improved model abilities for potentially similar tasks. In this paper, we propose Mashup Learning, a simple method to leverage the outputs of prior training runs to enhance model adaptation to new tasks. Our procedure identifies the most relevant historical checkpoints for a target dataset, aggregates them with model merging, and uses the result as an improved initialization for training. Across 8 standard LLM benchmarks, four models, and two collections of source checkpoints, Mashup Learning consistently improves average downstream accuracy by 0.5-5 percentage points over training from scratch. It also accelerates convergence, requiring 41-46% fewer training steps and up to 37% less total wall-clock time to match from-scratch accuracy, including all selection and merging overhead.

* 18 pages, 7 figures. Code: https://github.com/2son1a/mashup-learning

View paper on

Share this with someone who'll enjoy it:

Title:Mashup Learning: Faster Finetuning by Remixing Past Checkpoints

Paper and Code