Picture for Jian Peng

Jian Peng

School of Information Engineering, Jiangxi Vocational College of Finance & Economics, Jiujiang, China

Off-Policy Reinforcement Learning with Delayed Rewards

Add code
Jun 22, 2021
Figure 1 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 2 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 3 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 4 for Off-Policy Reinforcement Learning with Delayed Rewards
Viaarxiv icon

DAP: Detection-Aware Pre-training with Weak Supervision

Add code
Mar 30, 2021
Figure 1 for DAP: Detection-Aware Pre-training with Weak Supervision
Figure 2 for DAP: Detection-Aware Pre-training with Weak Supervision
Figure 3 for DAP: Detection-Aware Pre-training with Weak Supervision
Figure 4 for DAP: Detection-Aware Pre-training with Weak Supervision
Viaarxiv icon

Learning Neural Generative Dynamics for Molecular Conformation Generation

Add code
Feb 28, 2021
Figure 1 for Learning Neural Generative Dynamics for Molecular Conformation Generation
Figure 2 for Learning Neural Generative Dynamics for Molecular Conformation Generation
Figure 3 for Learning Neural Generative Dynamics for Molecular Conformation Generation
Figure 4 for Learning Neural Generative Dynamics for Molecular Conformation Generation
Viaarxiv icon

Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity

Add code
Nov 05, 2020
Figure 1 for Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Figure 2 for Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Figure 3 for Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Figure 4 for Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Viaarxiv icon

Off-Policy Interval Estimation with Lipschitz Value Iteration

Add code
Oct 29, 2020
Figure 1 for Off-Policy Interval Estimation with Lipschitz Value Iteration
Figure 2 for Off-Policy Interval Estimation with Lipschitz Value Iteration
Figure 3 for Off-Policy Interval Estimation with Lipschitz Value Iteration
Figure 4 for Off-Policy Interval Estimation with Lipschitz Value Iteration
Viaarxiv icon

Learning Guidance Rewards with Trajectory-space Smoothing

Add code
Oct 23, 2020
Figure 1 for Learning Guidance Rewards with Trajectory-space Smoothing
Figure 2 for Learning Guidance Rewards with Trajectory-space Smoothing
Figure 3 for Learning Guidance Rewards with Trajectory-space Smoothing
Figure 4 for Learning Guidance Rewards with Trajectory-space Smoothing
Viaarxiv icon

Efficient Competitive Self-Play Policy Optimization

Add code
Sep 13, 2020
Figure 1 for Efficient Competitive Self-Play Policy Optimization
Figure 2 for Efficient Competitive Self-Play Policy Optimization
Figure 3 for Efficient Competitive Self-Play Policy Optimization
Figure 4 for Efficient Competitive Self-Play Policy Optimization
Viaarxiv icon

Pre-training of Graph Neural Network for Modeling Effects of Mutations on Protein-Protein Binding Affinity

Add code
Aug 28, 2020
Figure 1 for Pre-training of Graph Neural Network for Modeling Effects of Mutations on Protein-Protein Binding Affinity
Figure 2 for Pre-training of Graph Neural Network for Modeling Effects of Mutations on Protein-Protein Binding Affinity
Figure 3 for Pre-training of Graph Neural Network for Modeling Effects of Mutations on Protein-Protein Binding Affinity
Figure 4 for Pre-training of Graph Neural Network for Modeling Effects of Mutations on Protein-Protein Binding Affinity
Viaarxiv icon

Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer

Add code
Jul 15, 2020
Figure 1 for Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer
Figure 2 for Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer
Figure 3 for Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer
Figure 4 for Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer
Viaarxiv icon

Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion

Add code
Jul 04, 2020
Figure 1 for Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion
Figure 2 for Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion
Figure 3 for Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion
Viaarxiv icon