Alert button
Picture for Shuming Ma

Shuming Ma

Alert button

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Add code
Bookmark button
Alert button
Jul 05, 2023
Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Furu Wei

Figure 1 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 2 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 3 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 4 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Viaarxiv icon

Kosmos-2: Grounding Multimodal Large Language Models to the World

Add code
Bookmark button
Alert button
Jun 27, 2023
Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Furu Wei

Figure 1 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 2 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 3 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 4 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Viaarxiv icon

Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus

Add code
Bookmark button
Alert button
May 18, 2023
Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Mrinmaya Sachan, Ryan Cotterell

Figure 1 for Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus
Figure 2 for Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus
Figure 3 for Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus
Figure 4 for Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus
Viaarxiv icon

On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation

Add code
Bookmark button
Alert button
May 18, 2023
Liang Chen, Shuming Ma, Dongdong Zhang, Furu Wei, Baobao Chang

Figure 1 for On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Figure 2 for On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Figure 3 for On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Figure 4 for On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Viaarxiv icon

On the Pareto Front of Multilingual Neural Machine Translation

Add code
Bookmark button
Alert button
Apr 07, 2023
Liang Chen, Shuming Ma, Dongdong Zhang, Furu Wei, Baobao Chang

Figure 1 for On the Pareto Front of Multilingual Neural Machine Translation
Figure 2 for On the Pareto Front of Multilingual Neural Machine Translation
Figure 3 for On the Pareto Front of Multilingual Neural Machine Translation
Figure 4 for On the Pareto Front of Multilingual Neural Machine Translation
Viaarxiv icon

Are More Layers Beneficial to Graph Transformers?

Add code
Bookmark button
Alert button
Mar 01, 2023
Haiteng Zhao, Shuming Ma, Dongdong Zhang, Zhi-Hong Deng, Furu Wei

Figure 1 for Are More Layers Beneficial to Graph Transformers?
Figure 2 for Are More Layers Beneficial to Graph Transformers?
Figure 3 for Are More Layers Beneficial to Graph Transformers?
Figure 4 for Are More Layers Beneficial to Graph Transformers?
Viaarxiv icon

Language Is Not All You Need: Aligning Perception with Language Models

Add code
Bookmark button
Alert button
Mar 01, 2023
Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei

Figure 1 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 2 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 3 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 4 for Language Is Not All You Need: Aligning Perception with Language Models
Viaarxiv icon

HanoiT: Enhancing Context-aware Translation via Selective Context

Add code
Bookmark button
Alert button
Jan 17, 2023
Jian Yang, Yuwei Yin, Shuming Ma, Liqun Yang, Hongcheng Guo, Haoyang Huang, Dongdong Zhang, Yutao Zeng, Zhoujun Li, Furu Wei

Figure 1 for HanoiT: Enhancing Context-aware Translation via Selective Context
Figure 2 for HanoiT: Enhancing Context-aware Translation via Selective Context
Figure 3 for HanoiT: Enhancing Context-aware Translation via Selective Context
Figure 4 for HanoiT: Enhancing Context-aware Translation via Selective Context
Viaarxiv icon

A Length-Extrapolatable Transformer

Add code
Bookmark button
Alert button
Dec 20, 2022
Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song, Furu Wei

Figure 1 for A Length-Extrapolatable Transformer
Figure 2 for A Length-Extrapolatable Transformer
Figure 3 for A Length-Extrapolatable Transformer
Figure 4 for A Length-Extrapolatable Transformer
Viaarxiv icon

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator

Add code
Bookmark button
Alert button
Dec 20, 2022
Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Zhoujun Li, Furu Wei

Figure 1 for GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Figure 2 for GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Figure 3 for GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Figure 4 for GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Viaarxiv icon