Benchmark pipeline for evaluating file-level localization in repository-level LLM repair on SWE-bench Verified tasks.
python benchmarking automated-program-repair empirical-software-engineering code-repair vllm llm-agents swe-bench mini-swe-agent issue-localization
-
Updated
May 17, 2026 - Python