Picture for Ryo Karakida

Ryo Karakida

Continual Learning in Modern Hopfield Networks with an Application to Diffusion Models

Add code
May 28, 2026
Viaarxiv icon

A Unified Framework for Critical Scaling of Inverse Temperature in Self-Attention

Add code
May 12, 2026
Viaarxiv icon

The Impact of Anisotropic Covariance Structure on the Training Dynamics and Generalization Error of Linear Networks

Add code
Jan 11, 2026
Viaarxiv icon

Recurrent Self-Attention Dynamics: An Energy-Agnostic Perspective from Jacobians

Add code
May 26, 2025
Figure 1 for Recurrent Self-Attention Dynamics: An Energy-Agnostic Perspective from Jacobians
Figure 2 for Recurrent Self-Attention Dynamics: An Energy-Agnostic Perspective from Jacobians
Figure 3 for Recurrent Self-Attention Dynamics: An Energy-Agnostic Perspective from Jacobians
Figure 4 for Recurrent Self-Attention Dynamics: An Energy-Agnostic Perspective from Jacobians
Viaarxiv icon

Local Loss Optimization in the Infinite Width: Stable Parameterization of Predictive Coding Networks and Target Propagation

Add code
Nov 04, 2024
Viaarxiv icon

Optimal Layer Selection for Latent Data Augmentation

Add code
Aug 24, 2024
Figure 1 for Optimal Layer Selection for Latent Data Augmentation
Figure 2 for Optimal Layer Selection for Latent Data Augmentation
Figure 3 for Optimal Layer Selection for Latent Data Augmentation
Figure 4 for Optimal Layer Selection for Latent Data Augmentation
Viaarxiv icon

Hierarchical Associative Memory, Parallelized MLP-Mixer, and Symmetry Breaking

Add code
Jun 18, 2024
Viaarxiv icon

Self-attention Networks Localize When QK-eigenspectrum Concentrates

Add code
Feb 03, 2024
Figure 1 for Self-attention Networks Localize When QK-eigenspectrum Concentrates
Figure 2 for Self-attention Networks Localize When QK-eigenspectrum Concentrates
Figure 3 for Self-attention Networks Localize When QK-eigenspectrum Concentrates
Figure 4 for Self-attention Networks Localize When QK-eigenspectrum Concentrates
Viaarxiv icon

On the Parameterization of Second-Order Optimization Effective Towards the Infinite Width

Add code
Dec 19, 2023
Viaarxiv icon

MLP-Mixer as a Wide and Sparse MLP

Add code
Jun 02, 2023
Viaarxiv icon