Alert button
Picture for Wenhan Xiong

Wenhan Xiong

Alert button

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Add code
Bookmark button
Alert button
Apr 16, 2024
Xuezhe Ma, Xiaomeng Yang, Wenhan Xiong, Beidi Chen, Lili Yu, Hao Zhang, Jonathan May, Luke Zettlemoyer, Omer Levy, Chunting Zhou

Viaarxiv icon

Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning

Add code
Bookmark button
Alert button
Mar 22, 2024
Rao Fu, Jingyu Liu, Xilun Chen, Yixin Nie, Wenhan Xiong

Figure 1 for Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Figure 2 for Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Figure 3 for Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Figure 4 for Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Viaarxiv icon

The Role of Chain-of-Thought in Complex Vision-Language Reasoning Task

Add code
Bookmark button
Alert button
Nov 15, 2023
Yifan Wu, Pengchuan Zhang, Wenhan Xiong, Barlas Oguz, James C. Gee, Yixin Nie

Viaarxiv icon

Sub-network Discovery and Soft-masking for Continual Learning of Mixed Tasks

Add code
Bookmark button
Alert button
Oct 13, 2023
Zixuan Ke, Bing Liu, Wenhan Xiong, Asli Celikyilmaz, Haoran Li

Viaarxiv icon

Effective Long-Context Scaling of Foundation Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Wenhan Xiong, Jingyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava, Rui Hou, Louis Martin, Rashi Rungta, Karthik Abinav Sankararaman, Barlas Oguz, Madian Khabsa, Han Fang, Yashar Mehdad, Sharan Narang, Kshitiz Malik, Angela Fan, Shruti Bhosale, Sergey Edunov, Mike Lewis, Sinong Wang, Hao Ma

Figure 1 for Effective Long-Context Scaling of Foundation Models
Figure 2 for Effective Long-Context Scaling of Foundation Models
Figure 3 for Effective Long-Context Scaling of Foundation Models
Figure 4 for Effective Long-Context Scaling of Foundation Models
Viaarxiv icon

LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models

Add code
Bookmark button
Alert button
Sep 07, 2023
Chi Han, Qifan Wang, Wenhan Xiong, Yu Chen, Heng Ji, Sinong Wang

Figure 1 for LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models
Figure 2 for LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models
Figure 3 for LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models
Figure 4 for LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models
Viaarxiv icon

Code Llama: Open Foundation Models for Code

Add code
Bookmark button
Alert button
Aug 25, 2023
Baptiste Rozière, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai Gat, Xiaoqing Ellen Tan, Yossi Adi, Jingyu Liu, Tal Remez, Jérémy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish Bhatt, Cristian Canton Ferrer, Aaron Grattafiori, Wenhan Xiong, Alexandre Défossez, Jade Copet, Faisal Azhar, Hugo Touvron, Louis Martin, Nicolas Usunier, Thomas Scialom, Gabriel Synnaeve

Figure 1 for Code Llama: Open Foundation Models for Code
Figure 2 for Code Llama: Open Foundation Models for Code
Figure 3 for Code Llama: Open Foundation Models for Code
Figure 4 for Code Llama: Open Foundation Models for Code
Viaarxiv icon