Alert button
Picture for Gabriel Pereyra

Gabriel Pereyra

Alert button

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

Dec 28, 2022
Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'Horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov

Figure 1 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Figure 2 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Figure 3 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Figure 4 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Viaarxiv icon

Large scale distributed neural network training through online distillation

Apr 09, 2018
Rohan Anil, Gabriel Pereyra, Alexandre Passos, Robert Ormandi, George E. Dahl, Geoffrey E. Hinton

Figure 1 for Large scale distributed neural network training through online distillation
Figure 2 for Large scale distributed neural network training through online distillation
Figure 3 for Large scale distributed neural network training through online distillation
Figure 4 for Large scale distributed neural network training through online distillation
Viaarxiv icon

Regularizing Neural Networks by Penalizing Confident Output Distributions

Jan 23, 2017
Gabriel Pereyra, George Tucker, Jan Chorowski, Łukasz Kaiser, Geoffrey Hinton

Figure 1 for Regularizing Neural Networks by Penalizing Confident Output Distributions
Figure 2 for Regularizing Neural Networks by Penalizing Confident Output Distributions
Figure 3 for Regularizing Neural Networks by Penalizing Confident Output Distributions
Figure 4 for Regularizing Neural Networks by Penalizing Confident Output Distributions
Viaarxiv icon

Batch Normalized Recurrent Neural Networks

Oct 05, 2015
César Laurent, Gabriel Pereyra, Philémon Brakel, Ying Zhang, Yoshua Bengio

Figure 1 for Batch Normalized Recurrent Neural Networks
Figure 2 for Batch Normalized Recurrent Neural Networks
Figure 3 for Batch Normalized Recurrent Neural Networks
Figure 4 for Batch Normalized Recurrent Neural Networks
Viaarxiv icon