Picture for Tobias Lindenbauer

Tobias Lindenbauer

From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents

Add code
May 29, 2025
Viaarxiv icon

GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git

Add code
May 28, 2025
Figure 1 for GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git
Figure 2 for GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git
Figure 3 for GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git
Figure 4 for GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git
Viaarxiv icon