Picture for Justine Gehring

Justine Gehring

FreshBrew: A Benchmark for Evaluating AI Agents on Java Code Migration

Add code
Oct 06, 2025
Viaarxiv icon

GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities

Add code
Jul 16, 2025
Viaarxiv icon