Alert button
Picture for Jason Wei

Jason Wei

Alert button

FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation

Oct 05, 2023
Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc Le, Thang Luong

Figure 1 for FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Figure 2 for FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Figure 3 for FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Figure 4 for FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Viaarxiv icon

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts

May 24, 2023
Sheng Shen, Le Hou, Yanqi Zhou, Nan Du, Shayne Longpre, Jason Wei, Hyung Won Chung, Barret Zoph, William Fedus, Xinyun Chen, Tu Vu, Yuexin Wu, Wuyang Chen, Albert Webson, Yunxuan Li, Vincent Zhao, Hongkun Yu, Kurt Keutzer, Trevor Darrell, Denny Zhou

Figure 1 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Figure 2 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Figure 3 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Figure 4 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Viaarxiv icon

A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

May 22, 2023
Shayne Longpre, Gregory Yauney, Emily Reif, Katherine Lee, Adam Roberts, Barret Zoph, Denny Zhou, Jason Wei, Kevin Robinson, David Mimno, Daphne Ippolito

Figure 1 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 2 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 3 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 4 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Viaarxiv icon

Larger language models do in-context learning differently

Mar 08, 2023
Jerry Wei, Jason Wei, Yi Tay, Dustin Tran, Albert Webson, Yifeng Lu, Xinyun Chen, Hanxiao Liu, Da Huang, Denny Zhou, Tengyu Ma

Figure 1 for Larger language models do in-context learning differently
Figure 2 for Larger language models do in-context learning differently
Figure 3 for Larger language models do in-context learning differently
Figure 4 for Larger language models do in-context learning differently
Viaarxiv icon

Foundation Models for Decision Making: Problems, Methods, and Opportunities

Mar 07, 2023
Sherry Yang, Ofir Nachum, Yilun Du, Jason Wei, Pieter Abbeel, Dale Schuurmans

Figure 1 for Foundation Models for Decision Making: Problems, Methods, and Opportunities
Figure 2 for Foundation Models for Decision Making: Problems, Methods, and Opportunities
Figure 3 for Foundation Models for Decision Making: Problems, Methods, and Opportunities
Figure 4 for Foundation Models for Decision Making: Problems, Methods, and Opportunities
Viaarxiv icon

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

Feb 14, 2023
Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts

Figure 1 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Figure 2 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Figure 3 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Figure 4 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Viaarxiv icon

Large Language Models Encode Clinical Knowledge

Dec 26, 2022
Karan Singhal, Shekoofeh Azizi, Tao Tu, S. Sara Mahdavi, Jason Wei, Hyung Won Chung, Nathan Scales, Ajay Tanwani, Heather Cole-Lewis, Stephen Pfohl, Perry Payne, Martin Seneviratne, Paul Gamble, Chris Kelly, Nathaneal Scharli, Aakanksha Chowdhery, Philip Mansfield, Blaise Aguera y Arcas, Dale Webster, Greg S. Corrado, Yossi Matias, Katherine Chou, Juraj Gottweis, Nenad Tomasev, Yun Liu, Alvin Rajkomar, Joelle Barral, Christopher Semturs, Alan Karthikesalingam, Vivek Natarajan

Figure 1 for Large Language Models Encode Clinical Knowledge
Figure 2 for Large Language Models Encode Clinical Knowledge
Figure 3 for Large Language Models Encode Clinical Knowledge
Figure 4 for Large Language Models Encode Clinical Knowledge
Viaarxiv icon

Inverse scaling can become U-shaped

Nov 14, 2022
Jason Wei, Yi Tay, Quoc V. Le

Figure 1 for Inverse scaling can become U-shaped
Figure 2 for Inverse scaling can become U-shaped
Figure 3 for Inverse scaling can become U-shaped
Figure 4 for Inverse scaling can become U-shaped
Viaarxiv icon