Picture for Zhiqiang Pu

Zhiqiang Pu

Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner

Add code
Mar 18, 2026
Viaarxiv icon

Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control

Add code
Mar 18, 2026
Viaarxiv icon

MI-DETR: A Strong Baseline for Moving Infrared Small Target Detection with Bio-Inspired Motion Integration

Add code
Mar 05, 2026
Viaarxiv icon

Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning

Add code
Jan 29, 2026
Viaarxiv icon

Heterogeneity in Multi-Agent Reinforcement Learning

Add code
Dec 28, 2025
Viaarxiv icon

TacEleven: generative tactic discovery for football open play

Add code
Nov 18, 2025
Viaarxiv icon

CoMoE: Contrastive Representation for Mixture-of-Experts in Parameter-Efficient Fine-tuning

Add code
May 23, 2025
Viaarxiv icon

Unreal-MAP: Unreal-Engine-Based General Platform for Multi-Agent Reinforcement Learning

Add code
Mar 20, 2025
Viaarxiv icon

Stochastic Trajectory Prediction under Unstructured Constraints

Add code
Mar 18, 2025
Figure 1 for Stochastic Trajectory Prediction under Unstructured Constraints
Figure 2 for Stochastic Trajectory Prediction under Unstructured Constraints
Figure 3 for Stochastic Trajectory Prediction under Unstructured Constraints
Figure 4 for Stochastic Trajectory Prediction under Unstructured Constraints
Viaarxiv icon

Causal Mean Field Multi-Agent Reinforcement Learning

Add code
Feb 20, 2025
Figure 1 for Causal Mean Field Multi-Agent Reinforcement Learning
Figure 2 for Causal Mean Field Multi-Agent Reinforcement Learning
Figure 3 for Causal Mean Field Multi-Agent Reinforcement Learning
Figure 4 for Causal Mean Field Multi-Agent Reinforcement Learning
Viaarxiv icon