-
Notifications
You must be signed in to change notification settings - Fork 10
Expand file tree
/
Copy pathCITATION.cff
More file actions
40 lines (40 loc) · 1.23 KB
/
CITATION.cff
File metadata and controls
40 lines (40 loc) · 1.23 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
title: "SlopCodeBench: Measuring Code Erosion Under Iterative Specification Refinement"
type: article
authors:
- family-names: Orlanski
given-names: Gabriel
- family-names: Roy
given-names: Devjeet
- family-names: Yun
given-names: Alexander
- family-names: Shin
given-names: Changho
- family-names: Gu
given-names: Alex
- family-names: Ge
given-names: Albert
- family-names: Adila
given-names: Dyah
- family-names: Albarghouthi
given-names: Aws
- family-names: Sala
given-names: Frederic
doi: 10.5281/zenodo.19257129
date-released: 2025-12-18
url: "https://arxiv.org/abs/2603.24755"
repository-code: "https://github.com/SprocketLab/slop-code-bench"
license: MIT
keywords:
- coding-agents
- benchmark
- evaluation
- specification-refinement
- code-quality
abstract: >-
SlopCodeBench evaluates coding agents under iterative specification
refinement: the agent implements a spec, then extends its own code as
the spec changes. This exposes behaviors that single-shot benchmarks
cannot measure, including path dependence, non-convergence, and
trade-offs between explicit handling and structural stability.