Picture for Weinan Zhang

Weinan Zhang

Large Language Models are Demonstration Pre-Selectors for Themselves

Add code
Jun 06, 2025
Viaarxiv icon

GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

Add code
May 24, 2025
Viaarxiv icon

The Real Barrier to LLM Agent Usability is Agentic ROI

Add code
May 23, 2025
Viaarxiv icon

Superplatforms Have to Attack AI Agents

Add code
May 23, 2025
Viaarxiv icon

InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation

Add code
May 21, 2025
Viaarxiv icon

NL-Debugging: Exploiting Natural Language as an Intermediate Representation for Code Debugging

Add code
May 21, 2025
Viaarxiv icon

Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese

Add code
May 16, 2025
Viaarxiv icon

MARFT: Multi-Agent Reinforcement Fine-Tuning

Add code
Apr 24, 2025
Viaarxiv icon

A Survey of AI Agent Protocols

Add code
Apr 23, 2025
Viaarxiv icon

Information-Theoretic Reward Decomposition for Generalizable RLHF

Add code
Apr 08, 2025
Viaarxiv icon