Picture for Wanjun Zhong

Wanjun Zhong

Concise and Precise Context Compression for Tool-Using Language Models

Add code
Jul 02, 2024
Viaarxiv icon

CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models

Add code
Mar 06, 2024
Figure 1 for CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Figure 2 for CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Figure 3 for CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Figure 4 for CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Viaarxiv icon

PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

Add code
Feb 26, 2024
Figure 1 for PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
Figure 2 for PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
Figure 3 for PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
Figure 4 for PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
Viaarxiv icon

Learning to Edit: Aligning LLMs with Knowledge Editing

Add code
Feb 19, 2024
Viaarxiv icon

Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

Add code
Jan 30, 2024
Viaarxiv icon

YODA: Teacher-Student Progressive Learning for Language Models

Add code
Jan 28, 2024
Viaarxiv icon

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

Add code
Dec 18, 2023
Viaarxiv icon

Data Management For Large Language Models: A Survey

Add code
Dec 04, 2023
Viaarxiv icon

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Add code
Nov 14, 2023
Figure 1 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Figure 2 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Figure 3 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Figure 4 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Viaarxiv icon

A Systematic Evaluation of Large Language Models on Out-of-Distribution Logical Reasoning Tasks

Add code
Oct 18, 2023
Viaarxiv icon