Picture for Qingfeng Sun

Qingfeng Sun

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Add code
Mar 03, 2026
Viaarxiv icon

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

Add code
Mar 02, 2026
Viaarxiv icon

OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents

Add code
Jan 26, 2026
Viaarxiv icon

AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent

Add code
Dec 23, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models

Add code
Dec 23, 2024
Viaarxiv icon

Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models

Add code
Dec 22, 2024
Figure 1 for Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models
Figure 2 for Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models
Figure 3 for Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models
Figure 4 for Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models
Viaarxiv icon

AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation

Add code
Aug 01, 2024
Viaarxiv icon

Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena

Add code
Jul 15, 2024
Figure 1 for Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
Figure 2 for Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
Figure 3 for Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
Figure 4 for Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
Viaarxiv icon

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Add code
Aug 18, 2023
Figure 1 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Figure 2 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Figure 3 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Figure 4 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Viaarxiv icon