Alert button
Picture for Runji Lin

Runji Lin

Alert button

Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach

Dec 19, 2023
Weiyu Ma, Qirui Mi, Xue Yan, Yuqiao Wu, Runji Lin, Haifeng Zhang, Jun Wang

Viaarxiv icon

Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models

Nov 15, 2023
Keming Lu, Hongyi Yuan, Runji Lin, Junyang Lin, Zheng Yuan, Chang Zhou, Jingren Zhou

Viaarxiv icon

Qwen Technical Report

Sep 28, 2023
Jinze Bai, Shuai Bai, Yunfei Chu, Zeyu Cui, Kai Dang, Xiaodong Deng, Yang Fan, Wenbin Ge, Yu Han, Fei Huang, Binyuan Hui, Luo Ji, Mei Li, Junyang Lin, Runji Lin, Dayiheng Liu, Gao Liu, Chengqiang Lu, Keming Lu, Jianxin Ma, Rui Men, Xingzhang Ren, Xuancheng Ren, Chuanqi Tan, Sinan Tan, Jianhong Tu, Peng Wang, Shijie Wang, Wei Wang, Shengguang Wu, Benfeng Xu, Jin Xu, An Yang, Hao Yang, Jian Yang, Shusheng Yang, Yang Yao, Bowen Yu, Hongyi Yuan, Zheng Yuan, Jianwei Zhang, Xingxuan Zhang, Yichang Zhang, Zhenru Zhang, Chang Zhou, Jingren Zhou, Xiaohuan Zhou, Tianhang Zhu

Figure 1 for Qwen Technical Report
Figure 2 for Qwen Technical Report
Figure 3 for Qwen Technical Report
Figure 4 for Qwen Technical Report
Viaarxiv icon

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models

Aug 15, 2023
Keming Lu, Hongyi Yuan, Zheng Yuan, Runji Lin, Junyang Lin, Chuanqi Tan, Chang Zhou, Jingren Zhou

Figure 1 for #InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
Figure 2 for #InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
Figure 3 for #InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
Figure 4 for #InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
Viaarxiv icon

Large Sequence Models for Sequential Decision-Making: A Survey

Jun 24, 2023
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang

Figure 1 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 2 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 3 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 4 for Large Sequence Models for Sequential Decision-Making: A Survey
Viaarxiv icon

Contextual Transformer for Offline Meta Reinforcement Learning

Nov 15, 2022
Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang

Figure 1 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 2 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 3 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 4 for Contextual Transformer for Offline Meta Reinforcement Learning
Viaarxiv icon

Fully Decentralized Model-based Policy Optimization for Networked Systems

Jul 13, 2022
Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang

Figure 1 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Figure 2 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Figure 3 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Figure 4 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Viaarxiv icon

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

May 30, 2022
Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang

Figure 1 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 2 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 3 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 4 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Viaarxiv icon