Picture for Yuchen Xiao

Yuchen Xiao

Offline Multi-agent Continual Cooperation via Skill Partition and Reuse

Add code
Jun 24, 2026
Viaarxiv icon

Continual Quadruped Robots Coordination via Semantic Skill Discovery

Add code
Jun 06, 2026
Viaarxiv icon

OmniXtreme: Breaking the Generality Barrier in High-Dynamic Humanoid Control

Add code
Feb 27, 2026
Viaarxiv icon

Self-motion as a structural prior for coherent and robust formation of cognitive maps

Add code
Dec 23, 2025
Figure 1 for Self-motion as a structural prior for coherent and robust formation of cognitive maps
Figure 2 for Self-motion as a structural prior for coherent and robust formation of cognitive maps
Figure 3 for Self-motion as a structural prior for coherent and robust formation of cognitive maps
Figure 4 for Self-motion as a structural prior for coherent and robust formation of cognitive maps
Viaarxiv icon

SegEarth-R2: Towards Comprehensive Language-guided Segmentation for Remote Sensing Images

Add code
Dec 23, 2025
Viaarxiv icon

A Recurrent Spiking Network with Hierarchical Intrinsic Excitability Modulation for Schema Learning

Add code
Jan 24, 2025
Viaarxiv icon

On Centralized Critics in Multi-Agent Reinforcement Learning

Add code
Aug 26, 2024
Figure 1 for On Centralized Critics in Multi-Agent Reinforcement Learning
Figure 2 for On Centralized Critics in Multi-Agent Reinforcement Learning
Figure 3 for On Centralized Critics in Multi-Agent Reinforcement Learning
Figure 4 for On Centralized Critics in Multi-Agent Reinforcement Learning
Viaarxiv icon

O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models

Add code
Oct 22, 2023
Figure 1 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 2 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 3 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 4 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Viaarxiv icon

On-Robot Bayesian Reinforcement Learning for POMDPs

Add code
Jul 22, 2023
Figure 1 for On-Robot Bayesian Reinforcement Learning for POMDPs
Figure 2 for On-Robot Bayesian Reinforcement Learning for POMDPs
Figure 3 for On-Robot Bayesian Reinforcement Learning for POMDPs
Figure 4 for On-Robot Bayesian Reinforcement Learning for POMDPs
Viaarxiv icon

Sequential Fair Resource Allocation under a Markov Decision Process Framework

Add code
Jan 10, 2023
Figure 1 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Figure 2 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Figure 3 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Figure 4 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Viaarxiv icon