Picture for Zhanhong Jiang

Zhanhong Jiang

LexiSafe: Offline Safe Reinforcement Learning with Lexicographic Safety-Reward Hierarchy

Add code
Feb 19, 2026
Viaarxiv icon

Balancing Utility and Privacy: Dynamically Private SGD with Random Projection

Add code
Sep 11, 2025
Viaarxiv icon

Data-driven Kinematic Modeling in Soft Robots: System Identification and Uncertainty Quantification

Add code
Jul 10, 2025
Figure 1 for Data-driven Kinematic Modeling in Soft Robots: System Identification and Uncertainty Quantification
Figure 2 for Data-driven Kinematic Modeling in Soft Robots: System Identification and Uncertainty Quantification
Figure 3 for Data-driven Kinematic Modeling in Soft Robots: System Identification and Uncertainty Quantification
Figure 4 for Data-driven Kinematic Modeling in Soft Robots: System Identification and Uncertainty Quantification
Viaarxiv icon

DeCAF: Decentralized Consensus-And-Factorization for Low-Rank Adaptation of Foundation Models

Add code
May 27, 2025
Viaarxiv icon

Bidirectional Linear Recurrent Models for Sequence-Level Multisource Fusion

Add code
Apr 11, 2025
Viaarxiv icon

FUSE: First-Order and Second-Order Unified SynthEsis in Stochastic Optimization

Add code
Mar 06, 2025
Viaarxiv icon

Enhancing PPO with Trajectory-Aware Hybrid Policies

Add code
Feb 21, 2025
Figure 1 for Enhancing PPO with Trajectory-Aware Hybrid Policies
Figure 2 for Enhancing PPO with Trajectory-Aware Hybrid Policies
Figure 3 for Enhancing PPO with Trajectory-Aware Hybrid Policies
Figure 4 for Enhancing PPO with Trajectory-Aware Hybrid Policies
Viaarxiv icon

RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception

Add code
Jan 31, 2025
Figure 1 for RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception
Figure 2 for RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception
Figure 3 for RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception
Figure 4 for RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception
Viaarxiv icon

STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology

Add code
Dec 24, 2024
Figure 1 for STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology
Figure 2 for STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology
Figure 3 for STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology
Figure 4 for STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology
Viaarxiv icon

FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning

Add code
Dec 12, 2024
Figure 1 for FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning
Figure 2 for FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning
Figure 3 for FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning
Figure 4 for FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning
Viaarxiv icon