Picture for Zhong Zhang

Zhong Zhang

AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents

Add code
Feb 06, 2026
Viaarxiv icon

AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research

Add code
Feb 06, 2026
Viaarxiv icon

AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

Add code
Jan 13, 2026
Viaarxiv icon

MSDformer: Multi-scale Discrete Transformer For Time Series Generation

Add code
May 20, 2025
Viaarxiv icon

ToLeaP: Rethinking Development of Tool Learning with Large Language Models

Add code
May 17, 2025
Figure 1 for ToLeaP: Rethinking Development of Tool Learning with Large Language Models
Figure 2 for ToLeaP: Rethinking Development of Tool Learning with Large Language Models
Figure 3 for ToLeaP: Rethinking Development of Tool Learning with Large Language Models
Figure 4 for ToLeaP: Rethinking Development of Tool Learning with Large Language Models
Viaarxiv icon

AC-Reason: Towards Theory-Guided Actual Causality Reasoning with Large Language Models

Add code
May 13, 2025
Viaarxiv icon

2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization

Add code
Apr 10, 2025
Figure 1 for 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization
Figure 2 for 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization
Figure 3 for 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization
Figure 4 for 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization
Viaarxiv icon

Learning to Generate Structured Output with Schema Reinforcement Learning

Add code
Feb 26, 2025
Viaarxiv icon

AgentRM: Enhancing Agent Generalization with Reward Modeling

Add code
Feb 25, 2025
Viaarxiv icon

WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models

Add code
Nov 08, 2024
Viaarxiv icon