Picture for Song Yu

Song Yu

EP-GRPO: Entropy-Progress Aligned Group Relative Policy Optimization with Implicit Process Guidance

Add code
May 06, 2026
Viaarxiv icon

Exploring Boundary-Aware Spatial-Frequency Fusion for Camouflaged Object Detection

Add code
Apr 20, 2026
Viaarxiv icon

ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models

Add code
Mar 30, 2026
Viaarxiv icon

Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback

Add code
Nov 01, 2024
Figure 1 for Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
Figure 2 for Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
Figure 3 for Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
Figure 4 for Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
Viaarxiv icon

Adaptive optical signal-to-noise ratio recovery for long-distance optical fiber transmission

Add code
Aug 02, 2024
Figure 1 for Adaptive optical signal-to-noise ratio recovery for long-distance optical fiber transmission
Figure 2 for Adaptive optical signal-to-noise ratio recovery for long-distance optical fiber transmission
Figure 3 for Adaptive optical signal-to-noise ratio recovery for long-distance optical fiber transmission
Figure 4 for Adaptive optical signal-to-noise ratio recovery for long-distance optical fiber transmission
Viaarxiv icon