Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

Teaching According to Students' Aptitude: Personalized Mathematics Tutoring via Persona-, Memory-, and Forgetting-Aware LLMs

Add code
Nov 19, 2025
Viaarxiv icon

Free Lunch to Meet the Gap: Intermediate Domain Reconstruction for Cross-Domain Few-Shot Learning

Add code
Nov 18, 2025
Viaarxiv icon

Lean4Physics: Comprehensive Reasoning Framework for College-level Physics in Lean4

Add code
Oct 30, 2025
Viaarxiv icon

Edge Collaborative Gaussian Splatting with Integrated Rendering and Communication

Add code
Oct 26, 2025
Viaarxiv icon

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Add code
Oct 14, 2025
Viaarxiv icon

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Add code
Oct 06, 2025
Viaarxiv icon

Generalizable Geometric Image Caption Synthesis

Add code
Sep 18, 2025
Viaarxiv icon

Theoretical Analysis on how Learning Rate Warmup Accelerates Convergence

Add code
Sep 09, 2025
Viaarxiv icon

CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking

Add code
Sep 04, 2025
Figure 1 for CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
Figure 2 for CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
Figure 3 for CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
Figure 4 for CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
Viaarxiv icon

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Add code
Sep 03, 2025
Figure 1 for Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
Figure 2 for Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
Figure 3 for Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
Figure 4 for Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
Viaarxiv icon