Picture for Hongkang Yang

Hongkang Yang

Adaptive Preconditioners Trigger Loss Spikes in Adam

Add code
Jun 05, 2025
Viaarxiv icon

Scalable Complexity Control Facilitates Reasoning Ability of LLMs

Add code
May 29, 2025
Viaarxiv icon

MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models

Add code
May 28, 2025
Viaarxiv icon

$\text{Memory}^3$: Language Modeling with Explicit Memory

Add code
Jul 01, 2024
Figure 1 for $\text{Memory}^3$: Language Modeling with Explicit Memory
Figure 2 for $\text{Memory}^3$: Language Modeling with Explicit Memory
Figure 3 for $\text{Memory}^3$: Language Modeling with Explicit Memory
Figure 4 for $\text{Memory}^3$: Language Modeling with Explicit Memory
Viaarxiv icon

A Mathematical Framework for Learning Probability Distributions

Add code
Dec 28, 2022
Viaarxiv icon

Generalization Error of GAN from the Discriminator's Perspective

Add code
Jul 08, 2021
Figure 1 for Generalization Error of GAN from the Discriminator's Perspective
Viaarxiv icon

Generalization and Memorization: The Bias Potential Model

Add code
Jan 06, 2021
Figure 1 for Generalization and Memorization: The Bias Potential Model
Figure 2 for Generalization and Memorization: The Bias Potential Model
Figure 3 for Generalization and Memorization: The Bias Potential Model
Figure 4 for Generalization and Memorization: The Bias Potential Model
Viaarxiv icon