Picture for Keitaro Sakamoto

Keitaro Sakamoto

Locking Pretrained Weights via Deep Low-Rank Residual Distillation

Add code
May 11, 2026
Viaarxiv icon

Benign or Not-Benign Overfitting in Token Selection of Attention Mechanism

Add code
Sep 26, 2024
Figure 1 for Benign or Not-Benign Overfitting in Token Selection of Attention Mechanism
Figure 2 for Benign or Not-Benign Overfitting in Token Selection of Attention Mechanism
Figure 3 for Benign or Not-Benign Overfitting in Token Selection of Attention Mechanism
Figure 4 for Benign or Not-Benign Overfitting in Token Selection of Attention Mechanism
Viaarxiv icon

End-to-End Training Induces Information Bottleneck through Layer-Role Differentiation: A Comparative Analysis with Layer-wise Training

Add code
Feb 14, 2024
Viaarxiv icon

Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective

Add code
May 15, 2022
Figure 1 for Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective
Figure 2 for Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective
Figure 3 for Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective
Figure 4 for Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective
Viaarxiv icon