Picture for Jiajun Zhang

Jiajun Zhang

National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing, China, School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China

KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning

Add code
May 22, 2025
Viaarxiv icon

LENS: Multi-level Evaluation of Multimodal Reasoning with Large Language Models

Add code
May 21, 2025
Viaarxiv icon

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning

Add code
May 18, 2025
Viaarxiv icon

Towards Scientific Intelligence: A Survey of LLM-based Scientific Agents

Add code
Mar 31, 2025
Viaarxiv icon

Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment

Add code
Mar 06, 2025
Viaarxiv icon

LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs

Add code
Mar 04, 2025
Figure 1 for LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
Figure 2 for LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
Figure 3 for LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
Figure 4 for LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
Viaarxiv icon

An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning

Add code
Mar 04, 2025
Viaarxiv icon

LR${}^{2}$Bench: Evaluating Long-chain Reflective Reasoning Capabilities of Large Language Models via Constraint Satisfaction Problems

Add code
Feb 25, 2025
Viaarxiv icon

EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models

Add code
Feb 10, 2025
Viaarxiv icon

Aligning Instruction Tuning with Pre-training

Add code
Jan 16, 2025
Figure 1 for Aligning Instruction Tuning with Pre-training
Figure 2 for Aligning Instruction Tuning with Pre-training
Figure 3 for Aligning Instruction Tuning with Pre-training
Figure 4 for Aligning Instruction Tuning with Pre-training
Viaarxiv icon