Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zikang Yin

Agent-as-a-Service based on Agent Network

May 13, 2025

Yuhan Zhu, Haojie Liu, Jian Wang, Bing Li, Zikang Yin, Yefei Liao

Abstract:The rise of large model-based AI agents has spurred interest in Multi-Agent Systems (MAS) for their capabilities in decision-making, collaboration, and adaptability. While the Model Context Protocol (MCP) addresses tool invocation and data exchange challenges via a unified protocol, it lacks support for organizing agent-level collaboration. To bridge this gap, we propose Agent-as-a-Service based on Agent Network (AaaS-AN), a service-oriented paradigm grounded in the Role-Goal-Process-Service (RGPS) standard. AaaS-AN unifies the entire agent lifecycle, including construction, integration, interoperability, and networked collaboration, through two core components: (1) a dynamic Agent Network, which models agents and agent groups as vertexes that self-organize within the network based on task and role dependencies; (2) service-oriented agents, incorporating service discovery, registration, and interoperability protocols. These are orchestrated by a Service Scheduler, which leverages an Execution Graph to enable distributed coordination, context tracking, and runtime task management. We validate AaaS-AN on mathematical reasoning and application-level code generation tasks, which outperforms state-of-the-art baselines. Notably, we constructed a MAS based on AaaS-AN containing agent groups, Robotic Process Automation (RPA) workflows, and MCP servers over 100 agent services. We also release a dataset containing 10,000 long-horizon multi-agent workflows to facilitate future research on long-chain collaboration in MAS.

* work in progress

Via

Access Paper or Ask Questions

Non-Equilibrium MAV-Capture-MAV via Time-Optimal Planning and Reinforcement Learning

Mar 09, 2025

Canlun Zheng, Zhanyu Guo, Zikang Yin, Chunyu Wang, Zhikun Wang, Shiyu Zhao

Figure 1 for Non-Equilibrium MAV-Capture-MAV via Time-Optimal Planning and Reinforcement Learning

Figure 2 for Non-Equilibrium MAV-Capture-MAV via Time-Optimal Planning and Reinforcement Learning

Figure 3 for Non-Equilibrium MAV-Capture-MAV via Time-Optimal Planning and Reinforcement Learning

Figure 4 for Non-Equilibrium MAV-Capture-MAV via Time-Optimal Planning and Reinforcement Learning

Abstract:The capture of flying MAVs (micro aerial vehicles) has garnered increasing research attention due to its intriguing challenges and promising applications. Despite recent advancements, a key limitation of existing work is that capture strategies are often relatively simple and constrained by platform performance. This paper addresses control strategies capable of capturing high-maneuverability targets. The unique challenge of achieving target capture under unstable conditions distinguishes this task from traditional pursuit-evasion and guidance problems. In this study, we transition from larger MAV platforms to a specially designed, compact capture MAV equipped with a custom launching device while maintaining high maneuverability. We explore both time-optimal planning (TOP) and reinforcement learning (RL) methods. Simulations demonstrate that TOP offers highly maneuverable and shorter trajectories, while RL excels in real-time adaptability and stability. Moreover, the RL method has been tested in real-world scenarios, successfully achieving target capture even in unstable states.

Via

Access Paper or Ask Questions

TACO: General Acrobatic Flight Control via Target-and-Command-Oriented Reinforcement Learning

Mar 06, 2025

Zikang Yin, Canlun Zheng, Shiliang Guo, Zhikun Wang, Shiyu Zhao

Figure 1 for TACO: General Acrobatic Flight Control via Target-and-Command-Oriented Reinforcement Learning

Figure 2 for TACO: General Acrobatic Flight Control via Target-and-Command-Oriented Reinforcement Learning

Figure 3 for TACO: General Acrobatic Flight Control via Target-and-Command-Oriented Reinforcement Learning

Figure 4 for TACO: General Acrobatic Flight Control via Target-and-Command-Oriented Reinforcement Learning

Abstract:Although acrobatic flight control has been studied extensively, one key limitation of the existing methods is that they are usually restricted to specific maneuver tasks and cannot change flight pattern parameters online. In this work, we propose a target-and-command-oriented reinforcement learning (TACO) framework, which can handle different maneuver tasks in a unified way and allows online parameter changes. Additionally, we propose a spectral normalization method with input-output rescaling to enhance the policy's temporal and spatial smoothness, independence, and symmetry, thereby overcoming the sim-to-real gap. We validate the TACO approach through extensive simulation and real-world experiments, demonstrating its capability to achieve high-speed circular flights and continuous multi-flips.

* For the experiment video, please refer to https://www.youtube.com/watch?v=4tX_25BcMJw&ab_channel=WINDYLab

Via

Access Paper or Ask Questions