Picture for Chao Peng

Chao Peng

Richer Representations for Neural Algorithmic Reasoning via Auxiliary Reconstruction

Add code
May 30, 2026
Viaarxiv icon

SetupX: Can LLM Agents Learn from Past Failures in Functionality-Correct Code Repository Setup?

Add code
May 27, 2026
Viaarxiv icon

TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks

Add code
May 21, 2026
Viaarxiv icon

Evaluating Repository-level Software Documentation via Question Answering and Feature-Driven Development

Add code
Apr 08, 2026
Viaarxiv icon

Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios

Add code
Apr 08, 2026
Viaarxiv icon

Rethinking the Value of Agent-Generated Tests for LLM-Based Software Engineering Agents

Add code
Feb 08, 2026
Viaarxiv icon

Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks

Add code
Jan 26, 2026
Viaarxiv icon

Self-Augmented Mixture-of-Experts for QoS Prediction

Add code
Jan 16, 2026
Viaarxiv icon

Combating Spurious Correlations in Graph Interpretability via Self-Reflection

Add code
Jan 16, 2026
Viaarxiv icon

AdaGReS:Adaptive Greedy Context Selection via Redundancy-Aware Scoring for Token-Budgeted RAG

Add code
Dec 31, 2025
Viaarxiv icon