Picture for Xiaoqing Zheng

Xiaoqing Zheng

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Add code
Jan 23, 2026
Viaarxiv icon

Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction

Add code
Jan 08, 2026
Viaarxiv icon

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Add code
Jan 07, 2026
Viaarxiv icon

CSSG: Measuring Code Similarity with Semantic Graphs

Add code
Jan 07, 2026
Viaarxiv icon

Enhancing Model Privacy in Federated Learning with Random Masking and Quantization

Add code
Aug 27, 2025
Viaarxiv icon

Edge Intelligence with Spiking Neural Networks

Add code
Jul 18, 2025
Viaarxiv icon

Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning

Add code
Jun 04, 2025
Viaarxiv icon

Improving Continual Pre-training Through Seamless Data Packing

Add code
May 29, 2025
Figure 1 for Improving Continual Pre-training Through Seamless Data Packing
Figure 2 for Improving Continual Pre-training Through Seamless Data Packing
Figure 3 for Improving Continual Pre-training Through Seamless Data Packing
Figure 4 for Improving Continual Pre-training Through Seamless Data Packing
Viaarxiv icon

RECAST: Strengthening LLMs' Complex Instruction Following with Constraint-Verifiable Data

Add code
May 25, 2025
Viaarxiv icon

Chain-of-Model Learning for Language Model

Add code
May 17, 2025
Viaarxiv icon