ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation

Add code
Nov 22, 2024
Figure 1 for ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation
Figure 2 for ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation
Figure 3 for ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation
Figure 4 for ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: