Alert button
Picture for Shaohan Huang

Shaohan Huang

Alert button

BitNet: Scaling 1-bit Transformers for Large Language Models

Add code
Bookmark button
Alert button
Oct 17, 2023
Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang, Huaijie Wang, Lingxiao Ma, Fan Yang, Ruiping Wang, Yi Wu, Furu Wei

Viaarxiv icon

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Add code
Bookmark button
Alert button
Oct 04, 2023
Xichen Pan, Li Dong, Shaohan Huang, Zhiliang Peng, Wenhu Chen, Furu Wei

Figure 1 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Figure 2 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Figure 3 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Figure 4 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Viaarxiv icon

Calibrating LLM-Based Evaluator

Add code
Bookmark button
Alert button
Sep 23, 2023
Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Viaarxiv icon

Kosmos-2.5: A Multimodal Literate Model

Add code
Bookmark button
Alert button
Sep 20, 2023
Tengchao Lv, Yupan Huang, Jingye Chen, Lei Cui, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei

Figure 1 for Kosmos-2.5: A Multimodal Literate Model
Figure 2 for Kosmos-2.5: A Multimodal Literate Model
Figure 3 for Kosmos-2.5: A Multimodal Literate Model
Figure 4 for Kosmos-2.5: A Multimodal Literate Model
Viaarxiv icon

Adapting Large Language Models via Reading Comprehension

Add code
Bookmark button
Alert button
Sep 18, 2023
Daixuan Cheng, Shaohan Huang, Furu Wei

Viaarxiv icon

LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection

Add code
Bookmark button
Alert button
Sep 03, 2023
Jiaxing Qi, Shaohan Huang, Zhongzhi Luan, Carol Fung, Hailong Yang, Depei Qian

Figure 1 for LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection
Figure 2 for LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection
Figure 3 for LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection
Figure 4 for LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection
Viaarxiv icon

Retentive Network: A Successor to Transformer for Large Language Models

Add code
Bookmark button
Alert button
Aug 09, 2023
Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, Furu Wei

Figure 1 for Retentive Network: A Successor to Transformer for Large Language Models
Figure 2 for Retentive Network: A Successor to Transformer for Large Language Models
Figure 3 for Retentive Network: A Successor to Transformer for Large Language Models
Figure 4 for Retentive Network: A Successor to Transformer for Large Language Models
Viaarxiv icon

Scaling Sentence Embeddings with Large Language Models

Add code
Bookmark button
Alert button
Jul 31, 2023
Ting Jiang, Shaohan Huang, Zhongzhi Luan, Deqing Wang, Fuzhen Zhuang

Figure 1 for Scaling Sentence Embeddings with Large Language Models
Figure 2 for Scaling Sentence Embeddings with Large Language Models
Figure 3 for Scaling Sentence Embeddings with Large Language Models
Figure 4 for Scaling Sentence Embeddings with Large Language Models
Viaarxiv icon

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Add code
Bookmark button
Alert button
Jul 19, 2023
Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Nanning Zheng, Furu Wei

Figure 1 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 2 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 3 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 4 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Viaarxiv icon