Picture for Masaki Kawamura

Masaki Kawamura

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Add code
Aug 26, 2025
Viaarxiv icon

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

Add code
May 05, 2025
Viaarxiv icon