Picture for Zhijiang Guo

Zhijiang Guo

TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review

Add code
Jun 09, 2025
Viaarxiv icon

TreeRPO: Tree Relative Policy Optimization

Add code
Jun 05, 2025
Viaarxiv icon

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Add code
May 29, 2025
Viaarxiv icon

AVerImaTeC: A Dataset for Automatic Verification of Image-Text Claims with Evidence from the Web

Add code
May 23, 2025
Viaarxiv icon

Activation-Guided Consensus Merging for Large Language Models

Add code
May 20, 2025
Viaarxiv icon

TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios

Add code
May 19, 2025
Viaarxiv icon

EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

Add code
May 19, 2025
Viaarxiv icon

From System 1 to System 2: A Survey of Reasoning Large Language Models

Add code
Feb 25, 2025
Viaarxiv icon

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

Add code
Jan 20, 2025
Viaarxiv icon

Long$^2$RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall

Add code
Oct 31, 2024
Viaarxiv icon