Picture for Huajun Chen

Huajun Chen

Zhejiang University

SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration

Add code
Jul 27, 2025
Viaarxiv icon

SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs

Add code
Jul 23, 2025
Figure 1 for SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
Figure 2 for SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
Figure 3 for SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
Figure 4 for SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
Viaarxiv icon

Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study

Add code
Jun 24, 2025
Figure 1 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Figure 2 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Figure 3 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Figure 4 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Viaarxiv icon

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Add code
Jun 24, 2025
Figure 1 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Figure 2 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Figure 3 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Figure 4 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Viaarxiv icon

OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases

Add code
Jun 14, 2025
Figure 1 for OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
Figure 2 for OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
Figure 3 for OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
Figure 4 for OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
Viaarxiv icon

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Add code
Jun 12, 2025
Viaarxiv icon

Beyond Completion: A Foundation Model for General Knowledge Graph Reasoning

Add code
May 28, 2025
Viaarxiv icon

Spatial Knowledge Graph-Guided Multimodal Synthesis

Add code
May 28, 2025
Viaarxiv icon

SciCUEval: A Comprehensive Dataset for Evaluating Scientific Context Understanding in Large Language Models

Add code
May 21, 2025
Figure 1 for SciCUEval: A Comprehensive Dataset for Evaluating Scientific Context Understanding in Large Language Models
Figure 2 for SciCUEval: A Comprehensive Dataset for Evaluating Scientific Context Understanding in Large Language Models
Figure 3 for SciCUEval: A Comprehensive Dataset for Evaluating Scientific Context Understanding in Large Language Models
Figure 4 for SciCUEval: A Comprehensive Dataset for Evaluating Scientific Context Understanding in Large Language Models
Viaarxiv icon

Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey

Add code
May 06, 2025
Figure 1 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Figure 2 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Figure 3 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Figure 4 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Viaarxiv icon