Alert button
Picture for Wanjun Zhong

Wanjun Zhong

Alert button

CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models

Mar 06, 2024
Zexuan Qiu, Jingjing Li, Shijue Huang, Wanjun Zhong, Irwin King

Figure 1 for CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Figure 2 for CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Figure 3 for CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Figure 4 for CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Viaarxiv icon

PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

Feb 26, 2024
Yiming Du, Hongru Wang, Zhengyi Zhao, Bin Liang, Baojun Wang, Wanjun Zhong, Zezhong Wang, Kam-Fai Wong

Viaarxiv icon

Learning to Edit: Aligning LLMs with Knowledge Editing

Feb 19, 2024
Yuxin Jiang, Yufei Wang, Chuhan Wu, Wanjun Zhong, Xingshan Zeng, Jiahui Gao, Liangyou Li, Xin Jiang, Lifeng Shang, Ruiming Tang, Qun Liu, Wei Wang

Viaarxiv icon

Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

Jan 30, 2024
Shijue Huang, Wanjun Zhong, Jianqiao Lu, Qi Zhu, Jiahui Gao, Weiwen Liu, Yutai Hou, Xingshan Zeng, Yasheng Wang, Lifeng Shang, Xin Jiang, Ruifeng Xu, Qun Liu

Viaarxiv icon

YODA: Teacher-Student Progressive Learning for Language Models

Jan 28, 2024
Jianqiao Lu, Wanjun Zhong, Yufei Wang, Zhijiang Guo, Qi Zhu, Wenyong Huang, Yanlin Wang, Fei Mi, Baojun Wang, Yasheng Wang, Lifeng Shang, Xin Jiang, Qun Liu

Viaarxiv icon

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

Dec 18, 2023
Jiahui Gao, Renjie Pi, Jipeng Zhang, Jiacheng Ye, Wanjun Zhong, Yufei Wang, Lanqing Hong, Jianhua Han, Hang Xu, Zhenguo Li, Lingpeng Kong

Viaarxiv icon

Data Management For Large Language Models: A Survey

Dec 04, 2023
Zige Wang, Wanjun Zhong, Yufei Wang, Qi Zhu, Fei Mi, Baojun Wang, Lifeng Shang, Xin Jiang, Qun Liu

Viaarxiv icon

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Nov 14, 2023
Yuxin Jiang, Yufei Wang, Xingshan Zeng, Wanjun Zhong, Liangyou Li, Fei Mi, Lifeng Shang, Xin Jiang, Qun Liu, Wei Wang

Figure 1 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Figure 2 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Figure 3 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Figure 4 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Viaarxiv icon

A Systematic Evaluation of Large Language Models on Out-of-Distribution Logical Reasoning Tasks

Oct 18, 2023
Qiming Bao, Gael Gendron, Alex Yuxuan Peng, Wanjun Zhong, Neset Tan, Yang Chen, Michael Witbrock, Jiamou Liu

Viaarxiv icon