Picture for Shuhao Guan

Shuhao Guan

$λ$-GRPO: Unifying the GRPO Frameworks with Learnable Token Preferences

Add code
Oct 08, 2025
Figure 1 for $λ$-GRPO: Unifying the GRPO Frameworks with Learnable Token Preferences
Figure 2 for $λ$-GRPO: Unifying the GRPO Frameworks with Learnable Token Preferences
Figure 3 for $λ$-GRPO: Unifying the GRPO Frameworks with Learnable Token Preferences
Figure 4 for $λ$-GRPO: Unifying the GRPO Frameworks with Learnable Token Preferences
Viaarxiv icon

DCR: Quantifying Data Contamination in LLMs Evaluation

Add code
Jul 15, 2025
Viaarxiv icon

PreP-OCR: A Complete Pipeline for Document Image Restoration and Enhanced OCR Accuracy

Add code
May 28, 2025
Viaarxiv icon

UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models

Add code
May 26, 2025
Viaarxiv icon

Stochastic Weight Sharing for Bayesian Neural Networks

Add code
May 23, 2025
Viaarxiv icon

Advancing Post-OCR Correction: A Comparative Study of Synthetic Data

Add code
Aug 05, 2024
Viaarxiv icon

Benchmark Data Contamination of Large Language Models: A Survey

Add code
Jun 06, 2024
Viaarxiv icon