Picture for Zhijiang Guo

Zhijiang Guo

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning

Add code
Mar 11, 2026
Viaarxiv icon

Efficient RLVR Training via Weighted Mutual Information Data Selection

Add code
Mar 02, 2026
Viaarxiv icon

When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?

Add code
Feb 04, 2026
Viaarxiv icon

Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

Merging Beyond: Streaming LLM Updates via Activation-Guided Rotations

Add code
Feb 03, 2026
Viaarxiv icon

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

Add code
Oct 09, 2025
Figure 1 for ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Figure 2 for ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Figure 3 for ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Figure 4 for ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Viaarxiv icon

When Inverse Data Outperforms: Exploring the Pitfalls of Mixed Data in Multi-Stage Fine-Tuning

Add code
Sep 16, 2025
Viaarxiv icon

ClimateViz: A Benchmark for Statistical Reasoning and Fact Verification on Scientific Charts

Add code
Jun 11, 2025
Viaarxiv icon

TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review

Add code
Jun 09, 2025
Figure 1 for TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review
Figure 2 for TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review
Figure 3 for TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review
Figure 4 for TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review
Viaarxiv icon

TreeRPO: Tree Relative Policy Optimization

Add code
Jun 05, 2025
Viaarxiv icon