Picture for Hangyu Mao

Hangyu Mao

GARDO: Reinforcing Diffusion Models without Reward Hacking

Add code
Dec 30, 2025
Viaarxiv icon

Kling-Omni Technical Report

Add code
Dec 18, 2025
Viaarxiv icon

GPG: Generalized Policy Gradient Theorem for Transformer-based Policies

Add code
Dec 11, 2025
Viaarxiv icon

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

NGENT: Next-Generation AI Agents Must Integrate Multi-Domain Abilities to Achieve Artificial General Intelligence

Add code
Apr 30, 2025
Viaarxiv icon

DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering

Add code
Apr 25, 2025
Viaarxiv icon

From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models

Add code
Mar 20, 2025
Figure 1 for From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models
Figure 2 for From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models
Figure 3 for From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models
Figure 4 for From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models
Viaarxiv icon

DMQR-RAG: Diverse Multi-Query Rewriting for RAG

Add code
Nov 20, 2024
Viaarxiv icon

SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks

Add code
Nov 19, 2024
Figure 1 for SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Figure 2 for SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Figure 3 for SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Figure 4 for SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Viaarxiv icon

Mimicking Human Intuition: Cognitive Belief-Driven Q-Learning

Add code
Oct 02, 2024
Viaarxiv icon