Picture for Fengxiang He

Fengxiang He

Efficient Counterfactual Reasoning in ProbLog via Single World Intervention Programs

Add code
Mar 20, 2026
Viaarxiv icon

Integrating LTL Constraints into PPO for Safe Reinforcement Learning

Add code
Mar 01, 2026
Viaarxiv icon

Generalisation of RLHF under Reward Shift and Clipped KL Regularisation

Add code
Feb 25, 2026
Viaarxiv icon

Near-Constant Strong Violation and Last-Iterate Convergence for Online CMDPs via Decaying Safety Margins

Add code
Feb 11, 2026
Viaarxiv icon

Rationality Measurement and Theory for Reinforcement Learning Agents

Add code
Feb 04, 2026
Viaarxiv icon

DeXposure-FM: A Time-series, Graph Foundation Model for Credit Exposures and Stability on Decentralized Financial Networks

Add code
Feb 03, 2026
Viaarxiv icon

Drawback of Enforcing Equivariance and its Compensation via the Lens of Expressive Power

Add code
Dec 10, 2025
Viaarxiv icon

The Current State of AI Bias Bounties: An Overview of Existing Programmes and Research

Add code
Oct 02, 2025
Figure 1 for The Current State of AI Bias Bounties: An Overview of Existing Programmes and Research
Figure 2 for The Current State of AI Bias Bounties: An Overview of Existing Programmes and Research
Viaarxiv icon

DICE: Data Influence Cascade in Decentralized Learning

Add code
Jul 09, 2025
Viaarxiv icon

When a Reinforcement Learning Agent Encounters Unknown Unknowns

Add code
May 19, 2025
Viaarxiv icon