Picture for Guoxin Chen

Guoxin Chen

JURY-RL: Votes Propose, Proofs Dispose for Label-Free RLVR

Add code
Apr 28, 2026
Viaarxiv icon

Toward Autonomous Long-Horizon Engineering for ML Research

Add code
Apr 14, 2026
Viaarxiv icon

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Add code
Mar 03, 2026
Viaarxiv icon

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

Add code
Feb 03, 2026
Viaarxiv icon

LLM-in-Sandbox Elicits General Agentic Intelligence

Add code
Jan 22, 2026
Viaarxiv icon

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Add code
Nov 10, 2025
Viaarxiv icon

MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning

Add code
Oct 06, 2025
Viaarxiv icon

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Add code
Sep 16, 2025
Figure 1 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 2 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 3 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 4 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Viaarxiv icon

From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization

Add code
Jul 09, 2025
Figure 1 for From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
Figure 2 for From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
Figure 3 for From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
Figure 4 for From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
Viaarxiv icon

Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning

Add code
Feb 17, 2025
Viaarxiv icon