Picture for Hanshan Zhang

Hanshan Zhang

Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining

Add code
Mar 06, 2025
Viaarxiv icon

Energy-Based Preference Model Offers Better Offline Alignment than the Bradley-Terry Preference Model

Add code
Dec 18, 2024
Viaarxiv icon

Uncertainty Sentence Sampling by Virtual Adversarial Perturbation

Add code
Oct 27, 2022
Figure 1 for Uncertainty Sentence Sampling by Virtual Adversarial Perturbation
Figure 2 for Uncertainty Sentence Sampling by Virtual Adversarial Perturbation
Figure 3 for Uncertainty Sentence Sampling by Virtual Adversarial Perturbation
Figure 4 for Uncertainty Sentence Sampling by Virtual Adversarial Perturbation
Viaarxiv icon