Picture for Chaojun Xiao

Chaojun Xiao

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data

Add code
May 08, 2025
Viaarxiv icon

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Add code
Feb 17, 2025
Viaarxiv icon

Densing Law of LLMs

Add code
Dec 05, 2024
Figure 1 for Densing Law of LLMs
Figure 2 for Densing Law of LLMs
Figure 3 for Densing Law of LLMs
Figure 4 for Densing Law of LLMs
Viaarxiv icon

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Add code
Nov 04, 2024
Figure 1 for Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Figure 2 for Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Figure 3 for Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Figure 4 for Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Viaarxiv icon

Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs

Add code
Oct 09, 2024
Figure 1 for Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs
Figure 2 for Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs
Figure 3 for Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs
Figure 4 for Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs
Viaarxiv icon

Exploring the Benefit of Activation Sparsity in Pre-training

Add code
Oct 04, 2024
Viaarxiv icon

Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads

Add code
Oct 02, 2024
Figure 1 for Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads
Figure 2 for Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads
Figure 3 for Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads
Figure 4 for Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads
Viaarxiv icon

From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents

Add code
Sep 05, 2024
Figure 1 for From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents
Figure 2 for From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents
Figure 3 for From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents
Figure 4 for From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents
Viaarxiv icon

Configurable Foundation Models: Building LLMs from a Modular Perspective

Add code
Sep 04, 2024
Figure 1 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 2 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 3 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 4 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Viaarxiv icon