Picture for Min Zhang

Min Zhang

Harbin Institute of Technology Shenzhen, Shenzhen, China

MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

Add code
May 07, 2026
Viaarxiv icon

DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models

Add code
May 05, 2026
Viaarxiv icon

Less Languages, Less Tokens: An Efficient Unified Logic Cross-lingual Chain-of-Thought Reasoning Framework

Add code
Apr 22, 2026
Viaarxiv icon

Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment

Add code
Apr 21, 2026
Viaarxiv icon

Mitigating Multimodal Hallucination via Phase-wise Self-reward

Add code
Apr 20, 2026
Viaarxiv icon

OGER: A Robust Offline-Guided Exploration Reward for Hybrid Reinforcement Learning

Add code
Apr 20, 2026
Viaarxiv icon

ToolOmni: Enabling Open-World Tool Use via Agentic learning with Proactive Retrieval and Grounded Execution

Add code
Apr 15, 2026
Viaarxiv icon

Empowering Video Translation using Multimodal Large Language Models

Add code
Apr 13, 2026
Viaarxiv icon

MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis

Add code
Apr 13, 2026
Viaarxiv icon

E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning

Add code
Apr 10, 2026
Viaarxiv icon