Alert button
Picture for Lawrence McAfee

Lawrence McAfee

Alert button

Stanford University

InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining

Add code
Bookmark button
Alert button
Oct 11, 2023
Boxin Wang, Wei Ping, Lawrence McAfee, Peng Xu, Bo Li, Mohammad Shoeybi, Bryan Catanzaro

Figure 1 for InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining
Figure 2 for InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining
Figure 3 for InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining
Figure 4 for InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining
Viaarxiv icon

Retrieval meets Long Context Large Language Models

Add code
Bookmark button
Alert button
Oct 04, 2023
Peng Xu, Wei Ping, Xianchao Wu, Lawrence McAfee, Chen Zhu, Zihan Liu, Sandeep Subramanian, Evelina Bakhturina, Mohammad Shoeybi, Bryan Catanzaro

Figure 1 for Retrieval meets Long Context Large Language Models
Figure 2 for Retrieval meets Long Context Large Language Models
Figure 3 for Retrieval meets Long Context Large Language Models
Figure 4 for Retrieval meets Long Context Large Language Models
Viaarxiv icon

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study

Add code
Bookmark button
Alert button
Apr 13, 2023
Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro

Figure 1 for Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Figure 2 for Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Figure 3 for Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Figure 4 for Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Viaarxiv icon

Reducing Activation Recomputation in Large Transformer Models

Add code
Bookmark button
Alert button
May 10, 2022
Vijay Korthikanti, Jared Casper, Sangkug Lym, Lawrence McAfee, Michael Andersch, Mohammad Shoeybi, Bryan Catanzaro

Figure 1 for Reducing Activation Recomputation in Large Transformer Models
Figure 2 for Reducing Activation Recomputation in Large Transformer Models
Figure 3 for Reducing Activation Recomputation in Large Transformer Models
Figure 4 for Reducing Activation Recomputation in Large Transformer Models
Viaarxiv icon

Utilizing Static Analysis and Code Generation to Accelerate Neural Networks

Add code
Bookmark button
Alert button
Jun 27, 2012
Lawrence McAfee, Kunle Olukotun

Figure 1 for Utilizing Static Analysis and Code Generation to Accelerate Neural Networks
Figure 2 for Utilizing Static Analysis and Code Generation to Accelerate Neural Networks
Figure 3 for Utilizing Static Analysis and Code Generation to Accelerate Neural Networks
Figure 4 for Utilizing Static Analysis and Code Generation to Accelerate Neural Networks
Viaarxiv icon