Picture for Wei Shen

Wei Shen

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Add code
Apr 23, 2025
Viaarxiv icon

Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model

Add code
Apr 22, 2025
Viaarxiv icon

LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs

Add code
Apr 20, 2025
Viaarxiv icon

A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

Add code
Apr 12, 2025
Viaarxiv icon

Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations

Add code
Apr 01, 2025
Viaarxiv icon

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Add code
Mar 31, 2025
Viaarxiv icon

Dereflection Any Image with Diffusion Priors and Diversified Data

Add code
Mar 21, 2025
Viaarxiv icon

Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding

Add code
Mar 18, 2025
Viaarxiv icon

A Token-level Text Image Foundation Model for Document Understanding

Add code
Mar 04, 2025
Viaarxiv icon

MDN: Mamba-Driven Dualstream Network For Medical Hyperspectral Image Segmentation

Add code
Feb 24, 2025
Viaarxiv icon