Picture for Utku Evci

Utku Evci

Dima

Spark Transformer: Reactivating Sparsity in FFN and Attention

Add code
Jun 07, 2025
Viaarxiv icon

Gemma 3 Technical Report

Add code
Mar 25, 2025
Viaarxiv icon

Compression Scaling Laws:Unifying Sparsity and Quantization

Add code
Feb 23, 2025
Viaarxiv icon

Learning Parameter Sharing with Tensor Decompositions and Sparsity

Add code
Nov 14, 2024
Figure 1 for Learning Parameter Sharing with Tensor Decompositions and Sparsity
Figure 2 for Learning Parameter Sharing with Tensor Decompositions and Sparsity
Figure 3 for Learning Parameter Sharing with Tensor Decompositions and Sparsity
Figure 4 for Learning Parameter Sharing with Tensor Decompositions and Sparsity
Viaarxiv icon

Towards Optimal Adapter Placement for Efficient Transfer Learning

Add code
Oct 21, 2024
Figure 1 for Towards Optimal Adapter Placement for Efficient Transfer Learning
Figure 2 for Towards Optimal Adapter Placement for Efficient Transfer Learning
Figure 3 for Towards Optimal Adapter Placement for Efficient Transfer Learning
Figure 4 for Towards Optimal Adapter Placement for Efficient Transfer Learning
Viaarxiv icon

Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

Add code
Feb 07, 2024
Figure 1 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Figure 2 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Figure 3 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Figure 4 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Viaarxiv icon

Scaling Laws for Sparsely-Connected Foundation Models

Add code
Sep 15, 2023
Viaarxiv icon

Dynamic Sparse Training with Structured Sparsity

Add code
May 03, 2023
Figure 1 for Dynamic Sparse Training with Structured Sparsity
Figure 2 for Dynamic Sparse Training with Structured Sparsity
Figure 3 for Dynamic Sparse Training with Structured Sparsity
Figure 4 for Dynamic Sparse Training with Structured Sparsity
Viaarxiv icon

JaxPruner: A concise library for sparsity research

Add code
May 02, 2023
Figure 1 for JaxPruner: A concise library for sparsity research
Figure 2 for JaxPruner: A concise library for sparsity research
Figure 3 for JaxPruner: A concise library for sparsity research
Viaarxiv icon

The Dormant Neuron Phenomenon in Deep Reinforcement Learning

Add code
Feb 24, 2023
Viaarxiv icon