Picture for Zhaopeng Meng

Zhaopeng Meng

Qibo: A Large Language Model for Traditional Chinese Medicine

Mar 24, 2024
Figure 1 for Qibo: A Large Language Model for Traditional Chinese Medicine
Figure 2 for Qibo: A Large Language Model for Traditional Chinese Medicine
Figure 3 for Qibo: A Large Language Model for Traditional Chinese Medicine
Figure 4 for Qibo: A Large Language Model for Traditional Chinese Medicine
Viaarxiv icon

Learning with Noisy Labels Using Collaborative Sample Selection and Contrastive Semi-Supervised Learning

Oct 24, 2023
Viaarxiv icon

Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration

Add code
Jun 12, 2023
Figure 1 for Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration
Figure 2 for Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration
Figure 3 for Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration
Figure 4 for Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration
Viaarxiv icon

HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach

Add code
Jun 10, 2023
Figure 1 for HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach
Figure 2 for HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach
Figure 3 for HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach
Figure 4 for HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach
Viaarxiv icon

In-Sample Policy Iteration for Offline Reinforcement Learning

Add code
Jun 09, 2023
Figure 1 for In-Sample Policy Iteration for Offline Reinforcement Learning
Figure 2 for In-Sample Policy Iteration for Offline Reinforcement Learning
Figure 3 for In-Sample Policy Iteration for Offline Reinforcement Learning
Figure 4 for In-Sample Policy Iteration for Offline Reinforcement Learning
Viaarxiv icon

ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation

Add code
Oct 26, 2022
Figure 1 for ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Figure 2 for ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Figure 3 for ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Figure 4 for ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Viaarxiv icon

PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations

Add code
Apr 06, 2022
Figure 1 for PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
Figure 2 for PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
Figure 3 for PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
Figure 4 for PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
Viaarxiv icon

Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning

Add code
Nov 19, 2021
Figure 1 for Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning
Figure 2 for Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning
Figure 3 for Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning
Figure 4 for Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning
Viaarxiv icon

Exploration in Deep Reinforcement Learning: A Comprehensive Survey

Sep 15, 2021
Figure 1 for Exploration in Deep Reinforcement Learning: A Comprehensive Survey
Figure 2 for Exploration in Deep Reinforcement Learning: A Comprehensive Survey
Figure 3 for Exploration in Deep Reinforcement Learning: A Comprehensive Survey
Figure 4 for Exploration in Deep Reinforcement Learning: A Comprehensive Survey
Viaarxiv icon

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation

Add code
Sep 12, 2021
Figure 1 for HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Figure 2 for HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Figure 3 for HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Figure 4 for HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Viaarxiv icon