Alert button
Picture for Furu Wei

Furu Wei

Alert button

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Add code
Bookmark button
Alert button
Jul 05, 2023
Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Furu Wei

Figure 1 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 2 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 3 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 4 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Viaarxiv icon

Kosmos-2: Grounding Multimodal Large Language Models to the World

Add code
Bookmark button
Alert button
Jun 27, 2023
Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Furu Wei

Figure 1 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 2 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 3 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 4 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Viaarxiv icon

Learning to Rank in Generative Retrieval

Add code
Bookmark button
Alert button
Jun 27, 2023
Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li

Figure 1 for Learning to Rank in Generative Retrieval
Figure 2 for Learning to Rank in Generative Retrieval
Figure 3 for Learning to Rank in Generative Retrieval
Figure 4 for Learning to Rank in Generative Retrieval
Viaarxiv icon

Knowledge Distillation of Large Language Models

Add code
Bookmark button
Alert button
Jun 14, 2023
Yuxian Gu, Li Dong, Furu Wei, Minlie Huang

Figure 1 for Knowledge Distillation of Large Language Models
Figure 2 for Knowledge Distillation of Large Language Models
Figure 3 for Knowledge Distillation of Large Language Models
Figure 4 for Knowledge Distillation of Large Language Models
Viaarxiv icon

Augmenting Language Models with Long-Term Memory

Add code
Bookmark button
Alert button
Jun 12, 2023
Weizhi Wang, Li Dong, Hao Cheng, Xiaodong Liu, Xifeng Yan, Jianfeng Gao, Furu Wei

Figure 1 for Augmenting Language Models with Long-Term Memory
Figure 2 for Augmenting Language Models with Long-Term Memory
Figure 3 for Augmenting Language Models with Long-Term Memory
Figure 4 for Augmenting Language Models with Long-Term Memory
Viaarxiv icon

Multiview Identifiers Enhanced Generative Retrieval

Add code
Bookmark button
Alert button
May 26, 2023
Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li

Figure 1 for Multiview Identifiers Enhanced Generative Retrieval
Figure 2 for Multiview Identifiers Enhanced Generative Retrieval
Figure 3 for Multiview Identifiers Enhanced Generative Retrieval
Figure 4 for Multiview Identifiers Enhanced Generative Retrieval
Viaarxiv icon

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

Add code
Bookmark button
Alert button
May 25, 2023
Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei

Figure 1 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 2 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 3 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 4 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Viaarxiv icon

TextDiffuser: Diffusion Models as Text Painters

Add code
Bookmark button
Alert button
May 24, 2023
Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

Figure 1 for TextDiffuser: Diffusion Models as Text Painters
Figure 2 for TextDiffuser: Diffusion Models as Text Painters
Figure 3 for TextDiffuser: Diffusion Models as Text Painters
Figure 4 for TextDiffuser: Diffusion Models as Text Painters
Viaarxiv icon

Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing

Add code
Bookmark button
Alert button
May 24, 2023
Tianyi Tang, Hongyuan Lu, Yuchen Eleanor Jiang, Haoyang Huang, Dongdong Zhang, Wayne Xin Zhao, Furu Wei

Figure 1 for Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing
Figure 2 for Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing
Figure 3 for Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing
Figure 4 for Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing
Viaarxiv icon

One-stop Training of Multiple Capacity Models

Add code
Bookmark button
Alert button
May 24, 2023
Lan Jiang, Haoyang Huang, Dongdong Zhang, Rui Jiang, Furu Wei

Figure 1 for One-stop Training of Multiple Capacity Models
Figure 2 for One-stop Training of Multiple Capacity Models
Figure 3 for One-stop Training of Multiple Capacity Models
Figure 4 for One-stop Training of Multiple Capacity Models
Viaarxiv icon