Alert button
Picture for Xiaozhe Ren

Xiaozhe Ren

Alert button

A Survey of Reasoning with Foundation Models

Dec 26, 2023
Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Zhangyue Yin, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Ji-Rong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, Zhenguo Li

Viaarxiv icon

EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge

Nov 23, 2023
Bufang Yang, Lixing He, Neiwen Ling, Zhenyu Yan, Guoliang Xing, Xian Shuai, Xiaozhe Ren, Xin Jiang

Figure 1 for EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Figure 2 for EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Figure 3 for EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Figure 4 for EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Viaarxiv icon

CAME: Confidence-guided Adaptive Memory Efficient Optimization

Jul 05, 2023
Yang Luo, Xiaozhe Ren, Zangwei Zheng, Zhuo Jiang, Xin Jiang, Yang You

Figure 1 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Figure 2 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Figure 3 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Figure 4 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Viaarxiv icon

Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline

May 22, 2023
Zangwei Zheng, Xiaozhe Ren, Fuzhao Xue, Yang Luo, Xin Jiang, Yang You

Figure 1 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Figure 2 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Figure 3 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Figure 4 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Viaarxiv icon

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

Mar 20, 2023
Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao

Figure 1 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Figure 2 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Figure 3 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Figure 4 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Viaarxiv icon

Deeper vs Wider: A Revisit of Transformer Configuration

May 24, 2022
Fuzhao Xue, Jianghai Chen, Aixin Sun, Xiaozhe Ren, Zangwei Zheng, Xiaoxin He, Xin Jiang, Yang You

Figure 1 for Deeper vs Wider: A Revisit of Transformer Configuration
Figure 2 for Deeper vs Wider: A Revisit of Transformer Configuration
Figure 3 for Deeper vs Wider: A Revisit of Transformer Configuration
Figure 4 for Deeper vs Wider: A Revisit of Transformer Configuration
Viaarxiv icon

One Student Knows All Experts Know: From Sparse to Dense

Jan 26, 2022
Fuzhao Xue, Xiaoxin He, Xiaozhe Ren, Yuxuan Lou, Yang You

Figure 1 for One Student Knows All Experts Know: From Sparse to Dense
Figure 2 for One Student Knows All Experts Know: From Sparse to Dense
Figure 3 for One Student Knows All Experts Know: From Sparse to Dense
Figure 4 for One Student Knows All Experts Know: From Sparse to Dense
Viaarxiv icon

Large-Scale Deep Learning Optimizations: A Comprehensive Survey

Nov 02, 2021
Xiaoxin He, Fuzhao Xue, Xiaozhe Ren, Yang You

Figure 1 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Figure 2 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Figure 3 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Figure 4 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Viaarxiv icon

EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation

Sep 16, 2021
Chenhe Dong, Guangrun Wang, Hang Xu, Jiefeng Peng, Xiaozhe Ren, Xiaodan Liang

Figure 1 for EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Figure 2 for EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Figure 3 for EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Figure 4 for EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Viaarxiv icon