Picture for Ping-Chun Hsieh

Ping-Chun Hsieh

Image Deraining via Self-supervised Reinforcement Learning

Add code
Mar 27, 2024
Figure 1 for Image Deraining via Self-supervised Reinforcement Learning
Figure 2 for Image Deraining via Self-supervised Reinforcement Learning
Figure 3 for Image Deraining via Self-supervised Reinforcement Learning
Figure 4 for Image Deraining via Self-supervised Reinforcement Learning
Viaarxiv icon

Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion

Add code
Mar 19, 2024
Viaarxiv icon

PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping

Add code
Dec 19, 2023
Viaarxiv icon

Accelerated Policy Gradient: On the Nesterov Momentum for Reinforcement Learning

Add code
Oct 18, 2023
Viaarxiv icon

Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs

Add code
Oct 17, 2023
Figure 1 for Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs
Figure 2 for Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs
Figure 3 for Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs
Figure 4 for Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs
Viaarxiv icon

Towards Human-Like RL: Taming Non-Naturalistic Behavior in Deep RL via Adaptive Behavioral Costs in 3D Games

Add code
Sep 27, 2023
Viaarxiv icon

Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees

Add code
Dec 10, 2022
Figure 1 for Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Figure 2 for Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Figure 3 for Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Figure 4 for Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Viaarxiv icon

Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-Snapshots

Add code
Dec 06, 2022
Viaarxiv icon

Neural Frank-Wolfe Policy Optimization for Region-of-Interest Intra-Frame Coding with HEVC/H.265

Add code
Sep 27, 2022
Figure 1 for Neural Frank-Wolfe Policy Optimization for Region-of-Interest Intra-Frame Coding with HEVC/H.265
Figure 2 for Neural Frank-Wolfe Policy Optimization for Region-of-Interest Intra-Frame Coding with HEVC/H.265
Figure 3 for Neural Frank-Wolfe Policy Optimization for Region-of-Interest Intra-Frame Coding with HEVC/H.265
Figure 4 for Neural Frank-Wolfe Policy Optimization for Region-of-Interest Intra-Frame Coding with HEVC/H.265
Viaarxiv icon

Neural Contextual Bandits via Reward-Biased Maximum Likelihood Estimation

Add code
Mar 08, 2022
Figure 1 for Neural Contextual Bandits via Reward-Biased Maximum Likelihood Estimation
Figure 2 for Neural Contextual Bandits via Reward-Biased Maximum Likelihood Estimation
Figure 3 for Neural Contextual Bandits via Reward-Biased Maximum Likelihood Estimation
Figure 4 for Neural Contextual Bandits via Reward-Biased Maximum Likelihood Estimation
Viaarxiv icon