Picture for Ningyu Zhang

Ningyu Zhang

OceanGym: A Benchmark Environment for Underwater Embodied Agents

Add code
Sep 30, 2025
Viaarxiv icon

Memp: Exploring Agent Procedural Memory

Add code
Aug 08, 2025
Viaarxiv icon

Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study

Add code
Jun 24, 2025
Figure 1 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Figure 2 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Figure 3 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Figure 4 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Viaarxiv icon

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Add code
Jun 24, 2025
Figure 1 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Figure 2 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Figure 3 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Figure 4 for KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Viaarxiv icon

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Add code
Jun 12, 2025
Viaarxiv icon

ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark

Add code
Jun 12, 2025
Viaarxiv icon

Spatial Knowledge Graph-Guided Multimodal Synthesis

Add code
May 28, 2025
Viaarxiv icon

UniErase: Unlearning Token as a Universal Erasure Primitive for Language Models

Add code
May 21, 2025
Viaarxiv icon

Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

Add code
May 20, 2025
Viaarxiv icon

Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey

Add code
May 06, 2025
Figure 1 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Figure 2 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Figure 3 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Figure 4 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Viaarxiv icon