Picture for Anastasiia Filippova

Anastasiia Filippova

Optimal Splitting of Language Models from Mixtures to Specialized Domains

Add code
Mar 19, 2026
Viaarxiv icon

Partial Parameter Updates for Efficient Distributed Training

Add code
Sep 26, 2025
Figure 1 for Partial Parameter Updates for Efficient Distributed Training
Figure 2 for Partial Parameter Updates for Efficient Distributed Training
Figure 3 for Partial Parameter Updates for Efficient Distributed Training
Figure 4 for Partial Parameter Updates for Efficient Distributed Training
Viaarxiv icon

Time-series attribution maps with regularized contrastive learning

Add code
Feb 17, 2025
Figure 1 for Time-series attribution maps with regularized contrastive learning
Figure 2 for Time-series attribution maps with regularized contrastive learning
Figure 3 for Time-series attribution maps with regularized contrastive learning
Figure 4 for Time-series attribution maps with regularized contrastive learning
Viaarxiv icon

No Need to Talk: Asynchronous Mixture of Language Models

Add code
Oct 04, 2024
Figure 1 for No Need to Talk: Asynchronous Mixture of Language Models
Figure 2 for No Need to Talk: Asynchronous Mixture of Language Models
Figure 3 for No Need to Talk: Asynchronous Mixture of Language Models
Figure 4 for No Need to Talk: Asynchronous Mixture of Language Models
Viaarxiv icon