Alert button
Picture for Shaohan Huang

Shaohan Huang

Alert button

Retentive Network: A Successor to Transformer for Large Language Models

Add code
Bookmark button
Alert button
Jul 19, 2023
Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, Furu Wei

Figure 1 for Retentive Network: A Successor to Transformer for Large Language Models
Figure 2 for Retentive Network: A Successor to Transformer for Large Language Models
Figure 3 for Retentive Network: A Successor to Transformer for Large Language Models
Figure 4 for Retentive Network: A Successor to Transformer for Large Language Models
Viaarxiv icon

Kosmos-2: Grounding Multimodal Large Language Models to the World

Add code
Bookmark button
Alert button
Jul 13, 2023
Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Furu Wei

Figure 1 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 2 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 3 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 4 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Viaarxiv icon

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Add code
Bookmark button
Alert button
Jul 05, 2023
Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Furu Wei

Figure 1 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 2 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 3 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 4 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Viaarxiv icon

Learning Music Sequence Representation from Text Supervision

Add code
Bookmark button
Alert button
May 31, 2023
Tianyu Chen, Yuan Xie, Shuai Zhang, Shaohan Huang, Haoyi Zhou, Jianxin Li

Viaarxiv icon

Dual-Alignment Pre-training for Cross-lingual Sentence Embedding

Add code
Bookmark button
Alert button
May 16, 2023
Ziheng Li, Shaohan Huang, Zihan Zhang, Zhi-Hong Deng, Qiang Lou, Haizhen Huang, Jian Jiao, Furu Wei, Weiwei Deng, Qi Zhang

Figure 1 for Dual-Alignment Pre-training for Cross-lingual Sentence Embedding
Figure 2 for Dual-Alignment Pre-training for Cross-lingual Sentence Embedding
Figure 3 for Dual-Alignment Pre-training for Cross-lingual Sentence Embedding
Figure 4 for Dual-Alignment Pre-training for Cross-lingual Sentence Embedding
Viaarxiv icon

Pre-training Language Model as a Multi-perspective Course Learner

Add code
Bookmark button
Alert button
May 06, 2023
Beiduo Chen, Shaohan Huang, Zihan Zhang, Wu Guo, Zhenhua Ling, Haizhen Huang, Furu Wei, Weiwei Deng, Qi Zhang

Figure 1 for Pre-training Language Model as a Multi-perspective Course Learner
Figure 2 for Pre-training Language Model as a Multi-perspective Course Learner
Figure 3 for Pre-training Language Model as a Multi-perspective Course Learner
Figure 4 for Pre-training Language Model as a Multi-perspective Course Learner
Viaarxiv icon

UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation

Add code
Bookmark button
Alert button
Mar 22, 2023
Daixuan Cheng, Shaohan Huang, Junyu Bi, Yuefeng Zhan, Jianfeng Liu, Yujing Wang, Hao Sun, Furu Wei, Denvy Deng, Qi Zhang

Figure 1 for UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
Figure 2 for UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
Figure 3 for UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
Figure 4 for UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
Viaarxiv icon

Language Is Not All You Need: Aligning Perception with Language Models

Add code
Bookmark button
Alert button
Mar 01, 2023
Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei

Figure 1 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 2 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 3 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 4 for Language Is Not All You Need: Aligning Perception with Language Models
Viaarxiv icon