Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Artan Sheshmani

Yau's Affine Normal Descent: Algorithmic Framework and Convergence Analysis

Mar 30, 2026

Yi-Shuai Niu, Artan Sheshmani, Shing-Tung Yau

Abstract:We propose Yau's Affine Normal Descent (YAND), a geometric framework for smooth unconstrained optimization in which search directions are defined by the equi-affine normal of level-set hypersurfaces. The resulting directions are invariant under volume-preserving affine transformations and intrinsically adapt to anisotropic curvature. Using the analytic representation of the affine normal from affine differential geometry, we establish its equivalence with the classical slice-centroid construction under convexity. For strictly convex quadratic objectives, affine-normal directions are collinear with Newton directions, implying one-step convergence under exact line search. For general smooth (possibly nonconvex) objectives, we characterize precisely when affine-normal directions yield strict descent and develop a line-search-based YAND. We establish global convergence under standard smoothness assumptions, linear convergence under strong convexity and Polyak-Lojasiewicz conditions, and quadratic local convergence near nondegenerate minimizers. We further show that affine-normal directions are robust under affine scalings, remaining insensitive to arbitrarily ill-conditioned transformations. Numerical experiments illustrate the geometric behavior of the method and its robustness under strong anisotropic scaling.

* 55 pages, 25 figures

Via

Access Paper or Ask Questions

Categorical Representation Learning and RG flow operators for algorithmic classifiers

Mar 15, 2022

Artan Sheshmani, Yizhuang You, Wenbo Fu, Ahmadreza Azizi

Figure 1 for Categorical Representation Learning and RG flow operators for algorithmic classifiers

Figure 2 for Categorical Representation Learning and RG flow operators for algorithmic classifiers

Figure 3 for Categorical Representation Learning and RG flow operators for algorithmic classifiers

Figure 4 for Categorical Representation Learning and RG flow operators for algorithmic classifiers

Abstract:Following the earlier formalism of the categorical representation learning (arXiv:2103.14770) by the first two authors, we discuss the construction of the "RG-flow based categorifier". Borrowing ideas from theory of renormalization group flows (RG) in quantum field theory, holographic duality, and hyperbolic geometry, and mixing them with neural ODE's, we construct a new algorithmic natural language processing (NLP) architecture, called the RG-flow categorifier or for short the RG categorifier, which is capable of data classification and generation in all layers. We apply our algorithmic platform to biomedical data sets and show its performance in the field of sequence-to-function mapping. In particular we apply the RG categorifier to particular genomic sequences of flu viruses and show how our technology is capable of extracting the information from given genomic sequences, find their hidden symmetries and dominant features, classify them and use the trained data to make stochastic prediction of new plausible generated sequences associated with new set of viruses which could avoid the human immune system. The content of the current article is part of the recent US patent application submitted by first two authors (U.S. Patent Application No.: 63/313.504).

* 31 pages, comments are very welcome

Via

Access Paper or Ask Questions

Categorical Representation Learning: Morphism is All You Need

Mar 30, 2021

Artan Sheshmani, Yizhuang You

Figure 1 for Categorical Representation Learning: Morphism is All You Need

Figure 2 for Categorical Representation Learning: Morphism is All You Need

Figure 3 for Categorical Representation Learning: Morphism is All You Need

Figure 4 for Categorical Representation Learning: Morphism is All You Need

Abstract:We provide a construction for categorical representation learning and introduce the foundations of "$\textit{categorifier}$". The central theme in representation learning is the idea of $\textbf{everything to vector}$. Every object in a dataset $\mathcal{S}$ can be represented as a vector in $\mathbb{R}^n$ by an $\textit{encoding map}$ $E: \mathcal{O}bj(\mathcal{S})\to\mathbb{R}^n$. More importantly, every morphism can be represented as a matrix $E: \mathcal{H}om(\mathcal{S})\to\mathbb{R}^{n}_{n}$. The encoding map $E$ is generally modeled by a $\textit{deep neural network}$. The goal of representation learning is to design appropriate tasks on the dataset to train the encoding map (assuming that an encoding is optimal if it universally optimizes the performance on various tasks). However, the latter is still a $\textit{set-theoretic}$ approach. The goal of the current article is to promote the representation learning to a new level via a $\textit{category-theoretic}$ approach. As a proof of concept, we provide an example of a text translator equipped with our technology, showing that our categorical learning model outperforms the current deep learning models by 17 times. The content of the current article is part of the recent US patent proposal (patent application number: 63110906).

* Fixed some typos. 16 pages. Comments are welcome

Via

Access Paper or Ask Questions