Alert button
Picture for Yi Tay

Yi Tay

Alert button

Recitation-Augmented Language Models

Add code
Bookmark button
Alert button
Oct 04, 2022
Zhiqing Sun, Xuezhi Wang, Yi Tay, Yiming Yang, Denny Zhou

Figure 1 for Recitation-Augmented Language Models
Figure 2 for Recitation-Augmented Language Models
Figure 3 for Recitation-Augmented Language Models
Figure 4 for Recitation-Augmented Language Models
Viaarxiv icon

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?

Add code
Bookmark button
Alert button
Jul 21, 2022
Yi Tay, Mostafa Dehghani, Samira Abnar, Hyung Won Chung, William Fedus, Jinfeng Rao, Sharan Narang, Vinh Q. Tran, Dani Yogatama, Donald Metzler

Figure 1 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 2 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 3 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 4 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Viaarxiv icon

Confident Adaptive Language Modeling

Add code
Bookmark button
Alert button
Jul 14, 2022
Tal Schuster, Adam Fisch, Jai Gupta, Mostafa Dehghani, Dara Bahri, Vinh Q. Tran, Yi Tay, Donald Metzler

Figure 1 for Confident Adaptive Language Modeling
Figure 2 for Confident Adaptive Language Modeling
Figure 3 for Confident Adaptive Language Modeling
Figure 4 for Confident Adaptive Language Modeling
Viaarxiv icon

Emergent Abilities of Large Language Models

Add code
Bookmark button
Alert button
Jun 15, 2022
Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus

Figure 1 for Emergent Abilities of Large Language Models
Figure 2 for Emergent Abilities of Large Language Models
Figure 3 for Emergent Abilities of Large Language Models
Figure 4 for Emergent Abilities of Large Language Models
Viaarxiv icon

Unifying Language Learning Paradigms

Add code
Bookmark button
Alert button
May 10, 2022
Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Neil Houlsby, Donald Metzler

Figure 1 for Unifying Language Learning Paradigms
Figure 2 for Unifying Language Learning Paradigms
Figure 3 for Unifying Language Learning Paradigms
Figure 4 for Unifying Language Learning Paradigms
Viaarxiv icon

ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference

Add code
Bookmark button
Alert button
Apr 25, 2022
Kai Hui, Honglei Zhuang, Tao Chen, Zhen Qin, Jing Lu, Dara Bahri, Ji Ma, Jai Prakash Gupta, Cicero Nogueira dos Santos, Yi Tay, Don Metzler

Figure 1 for ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference
Figure 2 for ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference
Figure 3 for ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference
Figure 4 for ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference
Viaarxiv icon

PaLM: Scaling Language Modeling with Pathways

Add code
Bookmark button
Alert button
Apr 19, 2022
Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-Ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-Hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel

Figure 1 for PaLM: Scaling Language Modeling with Pathways
Figure 2 for PaLM: Scaling Language Modeling with Pathways
Figure 3 for PaLM: Scaling Language Modeling with Pathways
Figure 4 for PaLM: Scaling Language Modeling with Pathways
Viaarxiv icon

HyperPrompt: Prompt-based Task-Conditioning of Transformers

Add code
Bookmark button
Alert button
Mar 01, 2022
Yun He, Huaixiu Steven Zheng, Yi Tay, Jai Gupta, Yu Du, Vamsi Aribandi, Zhe Zhao, YaGuang Li, Zhao Chen, Donald Metzler, Heng-Tze Cheng, Ed H. Chi

Figure 1 for HyperPrompt: Prompt-based Task-Conditioning of Transformers
Figure 2 for HyperPrompt: Prompt-based Task-Conditioning of Transformers
Figure 3 for HyperPrompt: Prompt-based Task-Conditioning of Transformers
Figure 4 for HyperPrompt: Prompt-based Task-Conditioning of Transformers
Viaarxiv icon

A New Generation of Perspective API: Efficient Multilingual Character-level Transformers

Add code
Bookmark button
Alert button
Feb 22, 2022
Alyssa Lees, Vinh Q. Tran, Yi Tay, Jeffrey Sorensen, Jai Gupta, Donald Metzler, Lucy Vasserman

Figure 1 for A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Figure 2 for A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Figure 3 for A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Figure 4 for A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Viaarxiv icon