ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation

Add code
Mar 08, 2025
Figure 1 for ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation
Figure 2 for ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation
Figure 3 for ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation
Figure 4 for ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: