Alert button
Picture for Yanqi Zhou

Yanqi Zhou

Alert button

FLuRKA: Fast fused Low-Rank & Kernel Attention

Add code
Bookmark button
Alert button
Jun 27, 2023
Ahan Gupta, Yueming Yuan, Yanqi Zhou, Charith Mendis

Figure 1 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Figure 2 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Figure 3 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Figure 4 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Viaarxiv icon

Brainformers: Trading Simplicity for Efficiency

Add code
Bookmark button
Alert button
May 29, 2023
Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laundon, Jeff Dean

Figure 1 for Brainformers: Trading Simplicity for Efficiency
Figure 2 for Brainformers: Trading Simplicity for Efficiency
Figure 3 for Brainformers: Trading Simplicity for Efficiency
Figure 4 for Brainformers: Trading Simplicity for Efficiency
Viaarxiv icon

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts

Add code
Bookmark button
Alert button
May 24, 2023
Sheng Shen, Le Hou, Yanqi Zhou, Nan Du, Shayne Longpre, Jason Wei, Hyung Won Chung, Barret Zoph, William Fedus, Xinyun Chen, Tu Vu, Yuexin Wu, Wuyang Chen, Albert Webson, Yunxuan Li, Vincent Zhao, Hongkun Yu, Kurt Keutzer, Trevor Darrell, Denny Zhou

Figure 1 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Figure 2 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Figure 3 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Figure 4 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Viaarxiv icon

GiPH: Generalizable Placement Learning for Adaptive Heterogeneous Computing

Add code
Bookmark button
Alert button
May 23, 2023
Yi Hu, Chaoran Zhang, Edward Andert, Harshul Singh, Aviral Shrivastava, James Laudon, Yanqi Zhou, Bob Iannucci, Carlee Joe-Wong

Figure 1 for GiPH: Generalizable Placement Learning for Adaptive Heterogeneous Computing
Figure 2 for GiPH: Generalizable Placement Learning for Adaptive Heterogeneous Computing
Figure 3 for GiPH: Generalizable Placement Learning for Adaptive Heterogeneous Computing
Figure 4 for GiPH: Generalizable Placement Learning for Adaptive Heterogeneous Computing
Viaarxiv icon

Learning Large Graph Property Prediction via Graph Segment Training

Add code
Bookmark button
Alert button
May 21, 2023
Kaidi Cao, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Dustin Zelle, Yanqi Zhou, Charith Mendis, Jure Leskovec, Bryan Perozzi

Figure 1 for Learning Large Graph Property Prediction via Graph Segment Training
Figure 2 for Learning Large Graph Property Prediction via Graph Segment Training
Figure 3 for Learning Large Graph Property Prediction via Graph Segment Training
Figure 4 for Learning Large Graph Property Prediction via Graph Segment Training
Viaarxiv icon

Lifelong Language Pretraining with Distribution-Specialized Experts

Add code
Bookmark button
Alert button
May 20, 2023
Wuyang Chen, Yanqi Zhou, Nan Du, Yanping Huang, James Laudon, Zhifeng Chen, Claire Cu

Figure 1 for Lifelong Language Pretraining with Distribution-Specialized Experts
Figure 2 for Lifelong Language Pretraining with Distribution-Specialized Experts
Figure 3 for Lifelong Language Pretraining with Distribution-Specialized Experts
Figure 4 for Lifelong Language Pretraining with Distribution-Specialized Experts
Viaarxiv icon

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference

Add code
Bookmark button
Alert button
Apr 11, 2023
Tao Lei, Junwen Bai, Siddhartha Brahma, Joshua Ainslie, Kenton Lee, Yanqi Zhou, Nan Du, Vincent Y. Zhao, Yuexin Wu, Bo Li, Yu Zhang, Ming-Wei Chang

Figure 1 for Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Figure 2 for Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Figure 3 for Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Figure 4 for Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Viaarxiv icon

Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search

Add code
Bookmark button
Alert button
Oct 04, 2022
Thanh Vu, Yanqi Zhou, Chunfeng Wen, Yueqi Li, Jan-Michael Frahm

Figure 1 for Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search
Figure 2 for Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search
Figure 3 for Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search
Figure 4 for Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search
Viaarxiv icon

Mixture-of-Experts with Expert Choice Routing

Add code
Bookmark button
Alert button
Feb 18, 2022
Yanqi Zhou, Tao Lei, Hanxiao Liu, Nan Du, Yanping Huang, Vincent Zhao, Andrew Dai, Zhifeng Chen, Quoc Le, James Laudon

Figure 1 for Mixture-of-Experts with Expert Choice Routing
Figure 2 for Mixture-of-Experts with Expert Choice Routing
Figure 3 for Mixture-of-Experts with Expert Choice Routing
Figure 4 for Mixture-of-Experts with Expert Choice Routing
Viaarxiv icon

LaMDA: Language Models for Dialog Applications

Add code
Bookmark button
Alert button
Feb 10, 2022
Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-Tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, YaGuang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-Ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-Hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-John, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-Arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le

Figure 1 for LaMDA: Language Models for Dialog Applications
Figure 2 for LaMDA: Language Models for Dialog Applications
Figure 3 for LaMDA: Language Models for Dialog Applications
Figure 4 for LaMDA: Language Models for Dialog Applications
Viaarxiv icon