Picture for Qingfeng Sun

Qingfeng Sun

OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents

Add code
Jan 26, 2026
Viaarxiv icon

AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent

Add code
Dec 23, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models

Add code
Dec 23, 2024
Viaarxiv icon

Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models

Add code
Dec 22, 2024
Figure 1 for Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models
Figure 2 for Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models
Figure 3 for Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models
Figure 4 for Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models
Viaarxiv icon

AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation

Add code
Aug 01, 2024
Viaarxiv icon

Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena

Add code
Jul 15, 2024
Figure 1 for Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
Figure 2 for Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
Figure 3 for Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
Figure 4 for Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
Viaarxiv icon

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Add code
Aug 18, 2023
Figure 1 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Figure 2 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Figure 3 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Figure 4 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Viaarxiv icon

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

Add code
Jun 14, 2023
Viaarxiv icon

Self-Supervised Multi-Modal Sequential Recommendation

Add code
Apr 26, 2023
Figure 1 for Self-Supervised Multi-Modal Sequential Recommendation
Figure 2 for Self-Supervised Multi-Modal Sequential Recommendation
Figure 3 for Self-Supervised Multi-Modal Sequential Recommendation
Figure 4 for Self-Supervised Multi-Modal Sequential Recommendation
Viaarxiv icon