Picture for Shikun Zhang

Shikun Zhang

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Add code
May 23, 2025
Viaarxiv icon

VLM-R$^3$: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought

Add code
May 22, 2025
Viaarxiv icon

MPL: Multiple Programming Languages with Large Language Models for Information Extraction

Add code
May 22, 2025
Viaarxiv icon

Mitigating Spurious Correlations with Causal Logit Perturbation

Add code
May 21, 2025
Viaarxiv icon

Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective

Add code
May 15, 2025
Viaarxiv icon

3D Surface Reconstruction with Enhanced High-Frequency Details

Add code
May 06, 2025
Viaarxiv icon

SaRO: Enhancing LLM Safety through Reasoning-based Alignment

Add code
Apr 13, 2025
Viaarxiv icon

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Add code
Mar 03, 2025
Viaarxiv icon

Outcome-Refining Process Supervision for Code Generation

Add code
Dec 19, 2024
Figure 1 for Outcome-Refining Process Supervision for Code Generation
Figure 2 for Outcome-Refining Process Supervision for Code Generation
Figure 3 for Outcome-Refining Process Supervision for Code Generation
Figure 4 for Outcome-Refining Process Supervision for Code Generation
Viaarxiv icon

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

Add code
Nov 17, 2024
Viaarxiv icon