Alert button
Picture for Furu Wei

Furu Wei

Alert button

MoEC: Mixture of Expert Clusters

Add code
Bookmark button
Alert button
Jul 19, 2022
Yuan Xie, Shaohan Huang, Tianyu Chen, Furu Wei

Figure 1 for MoEC: Mixture of Expert Clusters
Figure 2 for MoEC: Mixture of Expert Clusters
Figure 3 for MoEC: Mixture of Expert Clusters
Figure 4 for MoEC: Mixture of Expert Clusters
Viaarxiv icon

HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation

Add code
Bookmark button
Alert button
Jul 15, 2022
Jian Yang, Yuwei Yin, Shuming Ma, Dongdong Zhang, Zhoujun Li, Furu Wei

Figure 1 for HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation
Figure 2 for HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation
Figure 3 for HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation
Figure 4 for HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation
Viaarxiv icon

UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation

Add code
Bookmark button
Alert button
Jul 11, 2022
Jian Yang, Yuwei Yin, Shuming Ma, Dongdong Zhang, Shuangzhi Wu, Hongcheng Guo, Zhoujun Li, Furu Wei

Figure 1 for UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation
Figure 2 for UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation
Figure 3 for UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation
Figure 4 for UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation
Viaarxiv icon

SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval

Add code
Bookmark button
Alert button
Jul 06, 2022
Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang, Rangan Majumder, Furu Wei

Figure 1 for SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval
Figure 2 for SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval
Figure 3 for SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval
Figure 4 for SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval
Viaarxiv icon

Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training

Add code
Bookmark button
Alert button
Jun 21, 2022
Chengyi Wang, Yiming Wang, Yu Wu, Sanyuan Chen, Jinyu Li, Shujie Liu, Furu Wei

Figure 1 for Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training
Figure 2 for Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training
Viaarxiv icon

The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

Add code
Bookmark button
Alert button
Jun 14, 2022
Ziqiang Zhang, Junyi Ao, Long Zhou, Shujie Liu, Furu Wei, Jinyu Li

Figure 1 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Figure 2 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Figure 3 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Figure 4 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Viaarxiv icon

Language Models are General-Purpose Interfaces

Add code
Bookmark button
Alert button
Jun 13, 2022
Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei

Figure 1 for Language Models are General-Purpose Interfaces
Figure 2 for Language Models are General-Purpose Interfaces
Figure 3 for Language Models are General-Purpose Interfaces
Figure 4 for Language Models are General-Purpose Interfaces
Viaarxiv icon

VL-BEiT: Generative Vision-Language Pretraining

Add code
Bookmark button
Alert button
Jun 02, 2022
Hangbo Bao, Wenhui Wang, Li Dong, Furu Wei

Figure 1 for VL-BEiT: Generative Vision-Language Pretraining
Figure 2 for VL-BEiT: Generative Vision-Language Pretraining
Figure 3 for VL-BEiT: Generative Vision-Language Pretraining
Figure 4 for VL-BEiT: Generative Vision-Language Pretraining
Viaarxiv icon

Task-Specific Expert Pruning for Sparse Mixture-of-Experts

Add code
Bookmark button
Alert button
Jun 02, 2022
Tianyu Chen, Shaohan Huang, Yuan Xie, Binxing Jiao, Daxin Jiang, Haoyi Zhou, Jianxin Li, Furu Wei

Figure 1 for Task-Specific Expert Pruning for Sparse Mixture-of-Experts
Figure 2 for Task-Specific Expert Pruning for Sparse Mixture-of-Experts
Figure 3 for Task-Specific Expert Pruning for Sparse Mixture-of-Experts
Figure 4 for Task-Specific Expert Pruning for Sparse Mixture-of-Experts
Viaarxiv icon

THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption

Add code
Bookmark button
Alert button
Jun 02, 2022
Tianyu Chen, Hangbo Bao, Shaohan Huang, Li Dong, Binxing Jiao, Daxin Jiang, Haoyi Zhou, Jianxin Li, Furu Wei

Viaarxiv icon