Picture for Shujian Huang

Shujian Huang

Neuron-Aware Data Selection In Instruction Tuning For Large Language Models

Add code
Mar 13, 2026
Viaarxiv icon

ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection

Add code
Feb 25, 2026
Viaarxiv icon

GRRM: Group Relative Reward Modeling for Machine Translation

Add code
Feb 15, 2026
Viaarxiv icon

Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training

Add code
Feb 05, 2026
Viaarxiv icon

PEGRL: Improving Machine Translation by Post-Editing Guided Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers

Add code
Jan 29, 2026
Viaarxiv icon

Align to the Pivot: Dual Alignment with Self-Feedback for Multilingual Math Reasoning

Add code
Jan 25, 2026
Viaarxiv icon

Making Mathematical Reasoning Adaptive

Add code
Oct 06, 2025
Figure 1 for Making Mathematical Reasoning Adaptive
Figure 2 for Making Mathematical Reasoning Adaptive
Figure 3 for Making Mathematical Reasoning Adaptive
Figure 4 for Making Mathematical Reasoning Adaptive
Viaarxiv icon

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Add code
Aug 20, 2025
Viaarxiv icon

How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

Add code
May 27, 2025
Viaarxiv icon