Picture for Zhiqiu Xu

Zhiqiu Xu

Generative Modeling of Weights: Generalization or Memorization?

Add code
Jun 09, 2025
Viaarxiv icon

Idiosyncrasies in Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

Initializing Models with Larger Ones

Add code
Nov 30, 2023
Viaarxiv icon

A Coefficient Makes SVRG Effective

Add code
Nov 09, 2023
Viaarxiv icon

Dropout Reduces Underfitting

Add code
Mar 02, 2023
Viaarxiv icon