Picture for Ping-Chun Hsieh

Ping-Chun Hsieh

From Reward-Free Representations to Preferences: Rethinking Offline Preference-Based Reinforcement Learning

Add code
May 31, 2026
Viaarxiv icon

Plan2Cleanse: Test-Time Backdoor Defense via Monte-Carlo Planning in Deep Reinforcement Learning

Add code
May 10, 2026
Viaarxiv icon

A Reward-Free Viewpoint on Multi-Objective Reinforcement Learning

Add code
Apr 27, 2026
Viaarxiv icon

A Modularized Framework for Piecewise-Stationary Restless Bandits

Add code
Apr 11, 2026
Viaarxiv icon

Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics

Add code
Mar 12, 2026
Viaarxiv icon

Semi-Supervised Cross-Domain Imitation Learning

Add code
Feb 11, 2026
Viaarxiv icon

Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization

Add code
Nov 19, 2025
Viaarxiv icon

Action-Constrained Imitation Learning

Add code
Aug 20, 2025
Figure 1 for Action-Constrained Imitation Learning
Figure 2 for Action-Constrained Imitation Learning
Figure 3 for Action-Constrained Imitation Learning
Figure 4 for Action-Constrained Imitation Learning
Viaarxiv icon

BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL

Add code
May 29, 2025
Viaarxiv icon

Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs

Add code
Mar 17, 2025
Viaarxiv icon