feat: add kubernetes execution engine by clementblaise · Pull Request #378 · ridgesai/ridges

clementblaise · 2026-05-14T10:16:30Z

Summary

Introduces a Kubernetes-native execution engine as an alternative to the existing Docker Compose
backend. Instead of building and running task containers locally, the screener and validator now
submit Pods to a Kubernetes cluster where:

On-demand image builds via Kaniko — Task archives are fetched from S3 at Pod start time.
A Kaniko init container builds the image from the archive's Dockerfile and pushes it to an
in-cluster pull-through registry (Zot). No shared PVCs, no pre-building pipeline.
Proxy sidecar with iptables redirection — Each task Pod includes an init-iptables
container (NET_ADMIN) that transparently redirects port-443 traffic to the proxy's SNI router
on port 15443 (UID 1337 is exempt so the proxy itself can reach the internet). The proxy
enforces cost budgets, model restrictions, and OpenRouter workspace safety policies.
NetworkPolicy-based egress isolation — During the agent phase, only traffic to the proxy
is allowed. During verification, egress is unlocked so test harnesses can reach external
services. Phase transitions are controlled via Pod labels.
Commit hash resolution in containers — The API container previously reported
COMMIT_HASH=unknown (from the Dockerfile ARG default), causing every local screener
registration to fail with a hash mismatch. A three-tier resolution strategy now tries:
GIT_COMMIT env var → git rev-parse HEAD → pure-Python .git/HEAD reader. The local
docker-compose mounts .git as a read-only volume so the container can read the actual SHA
without needing the git binary installed.
KEDA autoscaling support — A /pending-work endpoint on the API returns the count of
queued evaluations so KEDA can scale screener StatefulSets to match demand.
Kind cluster manifests — Full local development environment with Kind config, Zot
registry, NetworkPolicies, and screener StatefulSets for testing the K8s backend without
cloud infrastructure.

Test plan

Local Kind cluster: make k8s-setup → run screener against a simple task → verify proxy logs show successful OpenRouter forwarding
Docker Compose (regression): docker-compose up → run local screener → verify commit hash matches and registration succeeds
Verify /pending-work endpoint returns correct counts with queued evaluations

…inerized builds

…Makefile targets

…car, and NetworkPolicy isolatio

…nishes normally

clementblaise marked this pull request as ready for review May 14, 2026 10:16

clementblaise added 8 commits May 15, 2026 16:37

feat: add Dockerfiles, GitHub Actions CI, and .dockerignore for conta…

50159b9

…inerized builds

fix: registry org

4942098

feat: add Kind cluster manifests, Zot registry, NetworkPolicies, and …

db28455

…Makefile targets

feat: add Kubernetes execution backend with Kaniko builds, proxy side…

24083dc

…car, and NetworkPolicy isolatio

feat: integrate Kubernetes backend into validator main loop and runner

b2372a5

feat: add /pending-work scaling endpoint for KEDA autoscaling

ef1c40a

chore: add .idea/ to gitignore

1187551

chore: remove build of proxy and artifact ref, add proxy env

e93278a

clementblaise force-pushed the k8s-execution-engine branch from daec9db to e93278a Compare May 15, 2026 08:44

clementblaise added 8 commits May 15, 2026 17:23

fix: cancellation poll task is never cancelled when the evaluation fi…

294a9af

…nishes normally

feat: add pull through registry

7dfe007

fix: use registry ssl in prod

f32bb8d

chore: add differ shutdown when eval

43a4912

chore: ruff format

fea8a49

fix: commit sha in docker

d39c90c

chore: ruff

d8f5995

chore: ruff

b91eaa5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add kubernetes execution engine#378

feat: add kubernetes execution engine#378
clementblaise wants to merge 16 commits into
mainfrom
k8s-execution-engine

clementblaise commented May 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

clementblaise commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

clementblaise commented May 14, 2026 •

edited

Loading