Picture for Minh V. T. Thai

Minh V. T. Thai

SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios

Add code
Dec 23, 2025
Figure 1 for SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios
Figure 2 for SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios
Figure 3 for SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios
Figure 4 for SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios
Viaarxiv icon