Alert button
Picture for Nan Du

Nan Du

Alert button

Mixture-of-Experts with Expert Choice Routing

Feb 18, 2022
Yanqi Zhou, Tao Lei, Hanxiao Liu, Nan Du, Yanping Huang, Vincent Zhao, Andrew Dai, Zhifeng Chen, Quoc Le, James Laudon

Figure 1 for Mixture-of-Experts with Expert Choice Routing
Figure 2 for Mixture-of-Experts with Expert Choice Routing
Figure 3 for Mixture-of-Experts with Expert Choice Routing
Figure 4 for Mixture-of-Experts with Expert Choice Routing
Viaarxiv icon

Designing Effective Sparse Expert Models

Feb 17, 2022
Barret Zoph, Irwan Bello, Sameer Kumar, Nan Du, Yanping Huang, Jeff Dean, Noam Shazeer, William Fedus

Figure 1 for Designing Effective Sparse Expert Models
Figure 2 for Designing Effective Sparse Expert Models
Figure 3 for Designing Effective Sparse Expert Models
Figure 4 for Designing Effective Sparse Expert Models
Viaarxiv icon

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Dec 13, 2021
Nan Du, Yanping Huang, Andrew M. Dai, Simon Tong, Dmitry Lepikhin, Yuanzhong Xu, Maxim Krikun, Yanqi Zhou, Adams Wei Yu, Orhan Firat, Barret Zoph, Liam Fedus, Maarten Bosma, Zongwei Zhou, Tao Wang, Yu Emma Wang, Kellie Webster, Marie Pellat, Kevin Robinson, Kathy Meier-Hellstern, Toju Duke, Lucas Dixon, Kun Zhang, Quoc V Le, Yonghui Wu, Zhifeng Chen, Claire Cui

Figure 1 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Figure 2 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Figure 3 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Figure 4 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Viaarxiv icon

Finetuned Language Models Are Zero-Shot Learners

Sep 03, 2021
Jason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, Andrew M. Dai, Quoc V. Le

Figure 1 for Finetuned Language Models Are Zero-Shot Learners
Figure 2 for Finetuned Language Models Are Zero-Shot Learners
Figure 3 for Finetuned Language Models Are Zero-Shot Learners
Figure 4 for Finetuned Language Models Are Zero-Shot Learners
Viaarxiv icon

R2D2: Relational Text Decoding with Transformers

May 10, 2021
Aryan Arbabi, Mingqiu Wang, Laurent El Shafey, Nan Du, Izhak Shafran

Figure 1 for R2D2: Relational Text Decoding with Transformers
Figure 2 for R2D2: Relational Text Decoding with Transformers
Figure 3 for R2D2: Relational Text Decoding with Transformers
Figure 4 for R2D2: Relational Text Decoding with Transformers
Viaarxiv icon

The Medical Scribe: Corpus Development and Model Performance Analyses

Mar 12, 2020
Izhak Shafran, Nan Du, Linh Tran, Amanda Perry, Lauren Keyes, Mark Knichel, Ashley Domin, Lei Huang, Yuhui Chen, Gang Li, Mingqiu Wang, Laurent El Shafey, Hagen Soltau, Justin S. Paul

Figure 1 for The Medical Scribe: Corpus Development and Model Performance Analyses
Figure 2 for The Medical Scribe: Corpus Development and Model Performance Analyses
Figure 3 for The Medical Scribe: Corpus Development and Model Performance Analyses
Figure 4 for The Medical Scribe: Corpus Development and Model Performance Analyses
Viaarxiv icon

Deep Physiological State Space Model for Clinical Forecasting

Dec 04, 2019
Yuan Xue, Denny Zhou, Nan Du, Andrew Dai, Zhen Xu, Kun Zhang, Claire Cui

Figure 1 for Deep Physiological State Space Model for Clinical Forecasting
Figure 2 for Deep Physiological State Space Model for Clinical Forecasting
Viaarxiv icon

Learning to Infer Entities, Properties and their Relations from Clinical Conversations

Aug 30, 2019
Nan Du, Mingqiu Wang, Linh Tran, Gang Li, Izhak Shafran

Figure 1 for Learning to Infer Entities, Properties and their Relations from Clinical Conversations
Figure 2 for Learning to Infer Entities, Properties and their Relations from Clinical Conversations
Figure 3 for Learning to Infer Entities, Properties and their Relations from Clinical Conversations
Figure 4 for Learning to Infer Entities, Properties and their Relations from Clinical Conversations
Viaarxiv icon

Multi-Grained Named Entity Recognition

Jun 20, 2019
Congying Xia, Chenwei Zhang, Tao Yang, Yaliang Li, Nan Du, Xian Wu, Wei Fan, Fenglong Ma, Philip Yu

Figure 1 for Multi-Grained Named Entity Recognition
Figure 2 for Multi-Grained Named Entity Recognition
Figure 3 for Multi-Grained Named Entity Recognition
Figure 4 for Multi-Grained Named Entity Recognition
Viaarxiv icon

Extracting Symptoms and their Status from Clinical Conversations

Jun 05, 2019
Nan Du, Kai Chen, Anjuli Kannan, Linh Tran, Yuhui Chen, Izhak Shafran

Figure 1 for Extracting Symptoms and their Status from Clinical Conversations
Figure 2 for Extracting Symptoms and their Status from Clinical Conversations
Figure 3 for Extracting Symptoms and their Status from Clinical Conversations
Figure 4 for Extracting Symptoms and their Status from Clinical Conversations
Viaarxiv icon