Picture for Ping-Chun Hsieh

Ping-Chun Hsieh

A Reward-Free Viewpoint on Multi-Objective Reinforcement Learning

Add code
Apr 27, 2026
Viaarxiv icon

A Modularized Framework for Piecewise-Stationary Restless Bandits

Add code
Apr 11, 2026
Viaarxiv icon

Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics

Add code
Mar 12, 2026
Viaarxiv icon

Semi-Supervised Cross-Domain Imitation Learning

Add code
Feb 11, 2026
Viaarxiv icon

Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization

Add code
Nov 19, 2025
Viaarxiv icon

Action-Constrained Imitation Learning

Add code
Aug 20, 2025
Figure 1 for Action-Constrained Imitation Learning
Figure 2 for Action-Constrained Imitation Learning
Figure 3 for Action-Constrained Imitation Learning
Figure 4 for Action-Constrained Imitation Learning
Viaarxiv icon

BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL

Add code
May 29, 2025
Viaarxiv icon

Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs

Add code
Mar 17, 2025
Viaarxiv icon

Imitation Learning of Correlated Policies in Stackelberg Games

Add code
Mar 11, 2025
Figure 1 for Imitation Learning of Correlated Policies in Stackelberg Games
Figure 2 for Imitation Learning of Correlated Policies in Stackelberg Games
Figure 3 for Imitation Learning of Correlated Policies in Stackelberg Games
Figure 4 for Imitation Learning of Correlated Policies in Stackelberg Games
Viaarxiv icon

Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation

Add code
Feb 28, 2025
Viaarxiv icon