Picture for Andrii Balashov

Andrii Balashov

Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models

Add code
Jul 23, 2025
Viaarxiv icon