Picture for Ruipeng Zhang

Ruipeng Zhang

Cooperative Medianet Innovation Center, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, China

Eliciting Medical Reasoning with Knowledge-enhanced Data Synthesis: A Semi-Supervised Reinforcement Learning Approach

Add code
Apr 13, 2026
Viaarxiv icon

When Maximum Entropy Misleads Policy Optimization

Add code
Jun 05, 2025
Figure 1 for When Maximum Entropy Misleads Policy Optimization
Figure 2 for When Maximum Entropy Misleads Policy Optimization
Figure 3 for When Maximum Entropy Misleads Policy Optimization
Figure 4 for When Maximum Entropy Misleads Policy Optimization
Viaarxiv icon

Improving Value Estimation Critically Enhances Vanilla Policy Gradient

Add code
May 25, 2025
Figure 1 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Figure 2 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Figure 3 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Figure 4 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Viaarxiv icon

Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation

Add code
Oct 03, 2024
Figure 1 for Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation
Figure 2 for Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation
Figure 3 for Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation
Figure 4 for Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation
Viaarxiv icon

Federated Learning under Partially Class-Disjoint Data via Manifold Reshaping

Add code
May 29, 2024
Viaarxiv icon

Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts

Add code
May 29, 2024
Figure 1 for Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Figure 2 for Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Figure 3 for Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Figure 4 for Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Viaarxiv icon

Federated Learning with Bilateral Curation for Partially Class-Disjoint Data

Add code
May 29, 2024
Viaarxiv icon

Fair Evaluation of Federated Learning Algorithms for Automated Breast Density Classification: The Results of the 2022 ACR-NCI-NVIDIA Federated Learning Challenge

Add code
May 22, 2024
Viaarxiv icon

UniChest: Conquer-and-Divide Pre-training for Multi-Source Chest X-Ray Classification

Add code
Dec 18, 2023
Figure 1 for UniChest: Conquer-and-Divide Pre-training for Multi-Source Chest X-Ray Classification
Figure 2 for UniChest: Conquer-and-Divide Pre-training for Multi-Source Chest X-Ray Classification
Figure 3 for UniChest: Conquer-and-Divide Pre-training for Multi-Source Chest X-Ray Classification
Figure 4 for UniChest: Conquer-and-Divide Pre-training for Multi-Source Chest X-Ray Classification
Viaarxiv icon

Learning-based Motion Planning in Dynamic Environments Using GNNs and Temporal Encoding

Add code
Oct 16, 2022
Figure 1 for Learning-based Motion Planning in Dynamic Environments Using GNNs and Temporal Encoding
Figure 2 for Learning-based Motion Planning in Dynamic Environments Using GNNs and Temporal Encoding
Figure 3 for Learning-based Motion Planning in Dynamic Environments Using GNNs and Temporal Encoding
Figure 4 for Learning-based Motion Planning in Dynamic Environments Using GNNs and Temporal Encoding
Viaarxiv icon