Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Meng Zhu

AdamX: An Adam improvement algorithm based on a novel exponential decay mechanism for the second-order moment estimate

Nov 19, 2025

Meng Zhu, Quan Xiao, Weidong Min

Figure 1 for AdamX: An Adam improvement algorithm based on a novel exponential decay mechanism for the second-order moment estimate

Figure 2 for AdamX: An Adam improvement algorithm based on a novel exponential decay mechanism for the second-order moment estimate

Figure 3 for AdamX: An Adam improvement algorithm based on a novel exponential decay mechanism for the second-order moment estimate

Figure 4 for AdamX: An Adam improvement algorithm based on a novel exponential decay mechanism for the second-order moment estimate

Abstract:Since the 21st century, artificial intelligence has been leading a new round of industrial revolution. Under the training framework, the optimization algorithm aims to stably converge high-dimensional optimization to local and even global minima. Entering the era of large language models, although the scale of model parameters and data has increased, Adam remains the mainstream optimization algorithm. However, compared with stochastic gradient descent (SGD) based optimization algorithms, Adam is more likely to converge to non-flat minima. To address this issue, the AdamX algorithm is proposed. Its core innovation lies in the proposition of a novel type of second-order moment estimation exponential decay rate, which gradually weakens the learning step correction strength as training progresses, and degrades to SGD in the stable training period, thereby improving the stability of training in the stable period and possibly enhancing generalization ability. Experimental results show that our second-order moment estimation exponential decay rate is better than the current second-order moment estimation exponential decay rate, and AdamX can stably outperform Adam and its variants in terms of performance. Our code is open-sourced at https://github.com/mengzhu0308/AdamX.

* 25 pages, 6 figures, 12 tables. Version 2: (1) Clarified i.i.d. assumption on gradient and noise components (implicitly used in v1). See Hypothesis 1 for details. (2) Refined abstract terminology: explicitly states degradation to momentum SGD. The theoretical results and conclusions remain unchanged

Via

Access Paper or Ask Questions

Stain-free Detection of Embryo Polarization using Deep Learning

Nov 08, 2021

Cheng Shen, Adiyant Lamba, Meng Zhu, Ray Zhang, Changhuei Yang, Magdalena Zernicka Goetz

Figure 1 for Stain-free Detection of Embryo Polarization using Deep Learning

Figure 2 for Stain-free Detection of Embryo Polarization using Deep Learning

Figure 3 for Stain-free Detection of Embryo Polarization using Deep Learning

Figure 4 for Stain-free Detection of Embryo Polarization using Deep Learning

Abstract:Polarization of the mammalian embryo at the right developmental time is critical for its development to term and would be valuable in assessing the potential of human embryos. However, tracking polarization requires invasive fluorescence staining, impermissible in the in vitro fertilization clinic. Here, we report the use of artificial intelligence to detect polarization from unstained time-lapse movies of mouse embryos. We assembled a dataset of bright-field movie frames from 8-cell-stage embryos, side-by-side with corresponding images of fluorescent markers of cell polarization. We then used an ensemble learning model to detect whether any bright-field frame showed an embryo before or after onset of polarization. Our resulting model has an accuracy of 85% for detecting polarization, significantly outperforming human volunteers trained on the same data (61% accuracy). We discovered that our self-learning model focuses upon the angle between cells as one known cue for compaction, which precedes polarization, but it outperforms the use of this cue alone. By compressing three-dimensional time-lapsed image data into two-dimensions, we are able to reduce data to an easily manageable size for deep learning processing. In conclusion, we describe a method for detecting a key developmental feature of embryo development that avoids clinically impermissible fluorescence staining.

Via

Access Paper or Ask Questions