Picture for Zhenduo Zhang

Zhenduo Zhang

Rethinking Sample Polarity in Reinforcement Learning with Verifiable Rewards

Add code
Dec 25, 2025
Viaarxiv icon

Results of the 2024 CommonRoad Motion Planning Competition for Autonomous Vehicles

Add code
Dec 22, 2025
Viaarxiv icon

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Figure 1 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 2 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 3 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 4 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Viaarxiv icon

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

Add code
Jun 18, 2025
Viaarxiv icon

Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning

Add code
May 22, 2025
Viaarxiv icon

SHARP: Synthesizing High-quality Aligned Reasoning Problems for Large Reasoning Models Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon

KAN4TSF: Are KAN and KAN-based models Effective for Time Series Forecasting?

Add code
Aug 21, 2024
Figure 1 for KAN4TSF: Are KAN and KAN-based models Effective for Time Series Forecasting?
Figure 2 for KAN4TSF: Are KAN and KAN-based models Effective for Time Series Forecasting?
Figure 3 for KAN4TSF: Are KAN and KAN-based models Effective for Time Series Forecasting?
Figure 4 for KAN4TSF: Are KAN and KAN-based models Effective for Time Series Forecasting?
Viaarxiv icon

CoIE: Chain-of-Instruct Editing for Multi-Attribute Face Manipulation

Add code
Dec 20, 2023
Viaarxiv icon

Improved Face Representation via Joint Label Classification and Supervised Contrastive Clustering

Add code
Dec 07, 2023
Viaarxiv icon