Picture for Keqin Bao

Keqin Bao

additional authors not shown

Unified Data Selection for LLM Reasoning

Add code
May 21, 2026
Viaarxiv icon

On Predicting the Post-training Potential of Pre-trained LLMs

Add code
May 12, 2026
Viaarxiv icon

SkillGraph: Skill-Augmented Reinforcement Learning for Agents via Evolving Skill Graphs

Add code
May 12, 2026
Viaarxiv icon

SAGE: Scalable Automated Robustness Augmentation for LLM Knowledge Evaluation

Add code
May 12, 2026
Viaarxiv icon

Towards Sample-Efficient and Stable Reinforcement Learning for LLM-based Recommendation

Add code
Jan 31, 2026
Viaarxiv icon

Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code

Add code
Jul 10, 2025
Figure 1 for Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
Figure 2 for Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
Figure 3 for Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
Figure 4 for Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
Viaarxiv icon

Boosting Parameter Efficiency in LLM-Based Recommendation through Sophisticated Pruning

Add code
Jul 09, 2025
Viaarxiv icon

CoRT: Code-integrated Reasoning within Thinking

Add code
Jun 12, 2025
Viaarxiv icon

MTR-Bench: A Comprehensive Benchmark for Multi-Turn Reasoning Evaluation

Add code
May 26, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Figure 1 for Qwen3 Technical Report
Figure 2 for Qwen3 Technical Report
Figure 3 for Qwen3 Technical Report
Figure 4 for Qwen3 Technical Report
Viaarxiv icon