Picture for Hao Sun

Hao Sun

MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning

Add code
Oct 06, 2025
Viaarxiv icon

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Add code
Sep 19, 2025
Viaarxiv icon

Flow Matching-Based Active Learning for Radio Map Construction with Low-Altitude UAVs

Add code
Sep 17, 2025
Viaarxiv icon

FlowDrive: Energy Flow Field for End-to-End Autonomous Driving

Add code
Sep 17, 2025
Viaarxiv icon

Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security

Add code
Jul 29, 2025
Viaarxiv icon

Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent

Add code
Jul 23, 2025
Figure 1 for Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent
Figure 2 for Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent
Figure 3 for Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent
Figure 4 for Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent
Viaarxiv icon

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities

Add code
Jul 17, 2025
Viaarxiv icon

EPIC: Efficient Prompt Interaction for Text-Image Classification

Add code
Jul 10, 2025
Viaarxiv icon

DynamicBench: Evaluating Real-Time Report Generation in Large Language Models

Add code
Jun 26, 2025
Viaarxiv icon

SlotPi: Physics-informed Object-centric Reasoning Models

Add code
Jun 12, 2025
Viaarxiv icon