Picture for Zhiqiang Zhang

Zhiqiang Zhang

Arrows of Math Reasoning Data Synthesis for Large Language Models: Diversity, Complexity and Correctness

Add code
Aug 26, 2025
Viaarxiv icon

A Deep Learning Pipeline Using Synthetic Data to Improve Interpretation of Paper ECG Images

Add code
Jul 29, 2025
Viaarxiv icon

Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models

Add code
Jul 24, 2025
Viaarxiv icon

Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning

Add code
Jul 24, 2025
Viaarxiv icon

WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training

Add code
Jul 23, 2025
Viaarxiv icon

Enhancing Cross-task Transfer of Large Language Models via Activation Steering

Add code
Jul 17, 2025
Viaarxiv icon

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

Add code
Jun 18, 2025
Viaarxiv icon

LogicCat: A Chain-of-Thought Text-to-SQL Benchmark for Multi-Domain Reasoning Challenges

Add code
May 24, 2025
Viaarxiv icon

Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning

Add code
May 22, 2025
Viaarxiv icon

SHARP: Synthesizing High-quality Aligned Reasoning Problems for Large Reasoning Models Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon