Picture for Tinghong Chen

Tinghong Chen

College of AI, Tsinghua University and Shanghai Qi Zhi Institute

Data Difficulty and the Generalization--Extrapolation Tradeoff in LLM Fine-Tuning

Add code
May 13, 2026
Viaarxiv icon

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Add code
Jul 03, 2025
Figure 1 for MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Figure 2 for MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Figure 3 for MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Figure 4 for MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Viaarxiv icon

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning

Add code
Jun 24, 2025
Viaarxiv icon