Picture for Zuyuan Zhang

Zuyuan Zhang

Cochain Perspectives on Temporal-Difference Signals for Learning Beyond Markov Dynamics

Add code
Feb 06, 2026
Viaarxiv icon

Structuring Value Representations via Geometric Coherence in Markov Decision Processes

Add code
Feb 03, 2026
Viaarxiv icon

Manifold-Constrained Energy-Based Transition Models for Offline Reinforcement Learning

Add code
Feb 02, 2026
Viaarxiv icon

Geometry of Drifting MDPs with Path-Integral Stability Certificates

Add code
Jan 29, 2026
Viaarxiv icon

Tail-Risk-Safe Monte Carlo Tree Search under PAC-Level Guarantees

Add code
Aug 07, 2025
Viaarxiv icon

Second-Order Convergence in Private Stochastic Non-Convex Optimization

Add code
May 21, 2025
Figure 1 for Second-Order Convergence in Private Stochastic Non-Convex Optimization
Figure 2 for Second-Order Convergence in Private Stochastic Non-Convex Optimization
Viaarxiv icon

Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration

Add code
Jan 10, 2025
Viaarxiv icon

Cooperative Backdoor Attack in Decentralized Reinforcement Learning with Theoretical Guarantee

Add code
May 24, 2024
Figure 1 for Cooperative Backdoor Attack in Decentralized Reinforcement Learning with Theoretical Guarantee
Figure 2 for Cooperative Backdoor Attack in Decentralized Reinforcement Learning with Theoretical Guarantee
Figure 3 for Cooperative Backdoor Attack in Decentralized Reinforcement Learning with Theoretical Guarantee
Viaarxiv icon

Collaborative AI Teaming in Unknown Environments via Active Goal Deduction

Add code
Mar 22, 2024
Viaarxiv icon