Alert button
Picture for Edward J. Hu

Edward J. Hu

Alert button

Amortizing intractable inference in large language models

Add code
Bookmark button
Alert button
Oct 06, 2023
Edward J. Hu, Moksh Jain, Eric Elmoznino, Younesse Kaddar, Guillaume Lajoie, Yoshua Bengio, Nikolay Malkin

Viaarxiv icon

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Add code
Bookmark button
Alert button
Mar 28, 2022
Greg Yang, Edward J. Hu, Igor Babuschkin, Szymon Sidor, Xiaodong Liu, David Farhi, Nick Ryder, Jakub Pachocki, Weizhu Chen, Jianfeng Gao

Figure 1 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 2 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 3 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 4 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Viaarxiv icon

GFlowNet Foundations

Add code
Bookmark button
Alert button
Nov 17, 2021
Yoshua Bengio, Tristan Deleu, Edward J. Hu, Salem Lahlou, Mo Tiwari, Emmanuel Bengio

Figure 1 for GFlowNet Foundations
Figure 2 for GFlowNet Foundations
Figure 3 for GFlowNet Foundations
Viaarxiv icon

LoRA: Low-Rank Adaptation of Large Language Models

Add code
Bookmark button
Alert button
Jun 17, 2021
Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Weizhu Chen

Figure 1 for LoRA: Low-Rank Adaptation of Large Language Models
Figure 2 for LoRA: Low-Rank Adaptation of Large Language Models
Figure 3 for LoRA: Low-Rank Adaptation of Large Language Models
Figure 4 for LoRA: Low-Rank Adaptation of Large Language Models
Viaarxiv icon

Feature Learning in Infinite-Width Neural Networks

Add code
Bookmark button
Alert button
Nov 30, 2020
Greg Yang, Edward J. Hu

Figure 1 for Feature Learning in Infinite-Width Neural Networks
Figure 2 for Feature Learning in Infinite-Width Neural Networks
Figure 3 for Feature Learning in Infinite-Width Neural Networks
Figure 4 for Feature Learning in Infinite-Width Neural Networks
Viaarxiv icon