Picture for Huajun Chen

Huajun Chen

Zhejiang University

LightMem: Lightweight and Efficient Memory-Augmented Generation

Add code
Oct 21, 2025
Viaarxiv icon

OceanGym: A Benchmark Environment for Underwater Embodied Agents

Add code
Sep 30, 2025
Viaarxiv icon

RTQA : Recursive Thinking for Complex Temporal Knowledge Graph Question Answering with Large Language Models

Add code
Sep 04, 2025
Viaarxiv icon

Memp: Exploring Agent Procedural Memory

Add code
Aug 08, 2025
Viaarxiv icon

SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration

Add code
Jul 27, 2025
Viaarxiv icon

SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs

Add code
Jul 23, 2025
Figure 1 for SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
Figure 2 for SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
Figure 3 for SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
Figure 4 for SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
Viaarxiv icon

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Add code
Jun 24, 2025
Figure 1 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Figure 2 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Figure 3 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Figure 4 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Viaarxiv icon

Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study

Add code
Jun 24, 2025
Viaarxiv icon

OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases

Add code
Jun 14, 2025
Figure 1 for OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
Figure 2 for OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
Figure 3 for OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
Figure 4 for OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
Viaarxiv icon

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Add code
Jun 12, 2025
Viaarxiv icon