Picture for Tue Le

Tue Le

SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios

Add code
Dec 23, 2025
Figure 1 for SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios
Figure 2 for SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios
Figure 3 for SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios
Figure 4 for SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios
Viaarxiv icon