Picture for Zhicheng Dou

Zhicheng Dou

ProRAG: Process-Supervised Reinforcement Learning for Retrieval-Augmented Generation

Add code
Jan 29, 2026
Viaarxiv icon

Agentic-R: Learning to Retrieve for Agentic Search

Add code
Jan 17, 2026
Viaarxiv icon

ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration

Add code
Jan 11, 2026
Viaarxiv icon

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Add code
Jan 09, 2026
Viaarxiv icon

Memory Matters More: Event-Centric Memory as a Logic Map for Agent Searching and Reasoning

Add code
Jan 08, 2026
Viaarxiv icon

SmartSearch: Process Reward-Guided Query Refinement for Search Agents

Add code
Jan 08, 2026
Viaarxiv icon

e5-omni: Explicit Cross-modal Alignment for Omni-modal Embeddings

Add code
Jan 07, 2026
Viaarxiv icon

Laser: Governing Long-Horizon Agentic Search via Structured Protocol and Context Register

Add code
Dec 23, 2025
Viaarxiv icon

Memory in the Age of AI Agents

Add code
Dec 15, 2025
Viaarxiv icon

GPG: Generalized Policy Gradient Theorem for Transformer-based Policies

Add code
Dec 11, 2025
Viaarxiv icon