Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hyunmo Kang

Learning curves theory for hierarchically compositional data with power-law distributed features

May 11, 2025

Francesco Cagnetta, Hyunmo Kang, Matthieu Wyart

Abstract:Recent theories suggest that Neural Scaling Laws arise whenever the task is linearly decomposed into power-law distributed units. Alternatively, scaling laws also emerge when data exhibit a hierarchically compositional structure, as is thought to occur in language and images. To unify these views, we consider classification and next-token prediction tasks based on probabilistic context-free grammars -- probabilistic models that generate data via a hierarchy of production rules. For classification, we show that having power-law distributed production rules results in a power-law learning curve with an exponent depending on the rules' distribution and a large multiplicative constant that depends on the hierarchical structure. By contrast, for next-token prediction, the distribution of production rules controls the local details of the learning curve, but not the exponent describing the large-scale behaviour.

Via

Access Paper or Ask Questions

Spectral Analysis of Representational Similarity with Limited Neurons

Feb 27, 2025

Hyunmo Kang, Abdulkadir Canatar, SueYeon Chung

Figure 1 for Spectral Analysis of Representational Similarity with Limited Neurons

Figure 2 for Spectral Analysis of Representational Similarity with Limited Neurons

Figure 3 for Spectral Analysis of Representational Similarity with Limited Neurons

Figure 4 for Spectral Analysis of Representational Similarity with Limited Neurons

Abstract:Measuring representational similarity between neural recordings and computational models is challenging due to constraints on the number of neurons that can be recorded simultaneously. In this work, we investigate how such limitations affect similarity measures, focusing on Canonical Correlation Analysis (CCA) and Centered Kernel Alignment (CKA). Leveraging tools from Random Matrix Theory, we develop a predictive spectral framework for these measures and demonstrate that finite neuron sampling systematically underestimates similarity due to eigenvector delocalization. To overcome this, we introduce a denoising method to infer population-level similarity, enabling accurate analysis even with small neuron samples. Our theory is validated on synthetic and real datasets, offering practical strategies for interpreting neural data under finite sampling constraints.

Via

Access Paper or Ask Questions