Picture for Ruipeng Zhang

Ruipeng Zhang

Cooperative Medianet Innovation Center, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, China

Sensitivity Shaping for Latent Modeling

Add code
Jun 12, 2026
Viaarxiv icon

P$^2$-DPO: Grounding Hallucination in Perceptual Processing via Calibration Direct Preference Optimization

Add code
Jun 03, 2026
Viaarxiv icon

Eliciting Medical Reasoning with Knowledge-enhanced Data Synthesis: A Semi-Supervised Reinforcement Learning Approach

Add code
Apr 13, 2026
Viaarxiv icon

When Maximum Entropy Misleads Policy Optimization

Add code
Jun 05, 2025
Figure 1 for When Maximum Entropy Misleads Policy Optimization
Figure 2 for When Maximum Entropy Misleads Policy Optimization
Figure 3 for When Maximum Entropy Misleads Policy Optimization
Figure 4 for When Maximum Entropy Misleads Policy Optimization
Viaarxiv icon

Improving Value Estimation Critically Enhances Vanilla Policy Gradient

Add code
May 25, 2025
Figure 1 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Figure 2 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Figure 3 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Figure 4 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Viaarxiv icon

Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation

Add code
Oct 03, 2024
Figure 1 for Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation
Figure 2 for Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation
Figure 3 for Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation
Figure 4 for Task-unaware Lifelong Robot Learning with Retrieval-based Weighted Local Adaptation
Viaarxiv icon

Federated Learning with Bilateral Curation for Partially Class-Disjoint Data

Add code
May 29, 2024
Viaarxiv icon

Federated Learning under Partially Class-Disjoint Data via Manifold Reshaping

Add code
May 29, 2024
Viaarxiv icon

Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts

Add code
May 29, 2024
Figure 1 for Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Figure 2 for Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Figure 3 for Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Figure 4 for Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Viaarxiv icon

Fair Evaluation of Federated Learning Algorithms for Automated Breast Density Classification: The Results of the 2022 ACR-NCI-NVIDIA Federated Learning Challenge

Add code
May 22, 2024
Viaarxiv icon