Picture for Darya Frolova

Darya Frolova

Rethinking Data: Towards Better Performing Domain-Specific Small Language Models

Add code
Mar 03, 2025
Figure 1 for Rethinking Data: Towards Better Performing Domain-Specific Small Language Models
Figure 2 for Rethinking Data: Towards Better Performing Domain-Specific Small Language Models
Figure 3 for Rethinking Data: Towards Better Performing Domain-Specific Small Language Models
Figure 4 for Rethinking Data: Towards Better Performing Domain-Specific Small Language Models
Viaarxiv icon

Attention Condensation via Sparsity Induced Regularized Training

Add code
Mar 03, 2025
Viaarxiv icon