Skip to content

Fix native deploy readiness: deploy.apply returned a name status lookup couldn't find#87

Open
rjckkkkk wants to merge 1 commit into
developfrom
fix/native-deploy-readiness-name
Open

Fix native deploy readiness: deploy.apply returned a name status lookup couldn't find#87
rjckkkkk wants to merge 1 commit into
developfrom
fix/native-deploy-readiness-name

Conversation

@rjckkkkk

Copy link
Copy Markdown
Collaborator

Problem

On Windows, deploying a model from the UI / onboarding (or aima run) showed "deployment started but not ready within 1m0s" even though the engine was actually serving fine — the model was reachable on its port the whole time.

Root cause: deploy.apply registers the native deployment under req.Name (== the model name — that's what deploy.list / undeploy use), but its result returned the sanitized pod name (SanitizePodName(model+"-"+engine), e.g. qwen2-5-vl-3b-instruct-q4-k-m-llamacpp) as "name". The run/onboarding readiness poller (deployRunCore.waitForDeployment) and the UI then polled deploy.status by that sanitized name, which the native runtime never registered → status was never ready → the loop ran to the timeout. The reuse path already returned the runtime name, so the two paths were inconsistent.

Fix

Return req.Name (the name the runtime actually keyed the deployment under) from the non-reuse path of deploy.apply, matching the reuse path and deploy.list/undeploy. One-field change in cmd/aima/tooldeps_deploy.go.

Verified on the AMD Strix Halo (Win11) rig

aima run Qwen2.5-VL-3B-Instruct-Q4_K_M"Ready!" in ~27 s (Endpoint http://127.0.0.1:8080, runtime native). Before the fix the identical command timed out at 60 s with "not ready within 1m" while the model was serving the whole time. Affects every native deploy via the UI/onboarding readiness path; K3S was unaffected (it keys by the sanitized pod name).

🤖 Generated with Claude Code

…detected

deploy.apply registered the native deployment under req.Name (== modelName) but
returned the sanitized pod name (model+engine) as "name". The run/onboarding
readiness poller and the UI then queried deploy.status by that sanitized name,
which the native runtime never registered → status was never found → the deploy
appeared stuck and timed out "not ready within 1m" even though the engine was
already serving. Return req.Name (the name the runtime actually keyed the
deployment under), matching the reuse path and deploy.list/undeploy.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
rjckkkkk added a commit that referenced this pull request Jun 12, 2026
)

New version-stamped build aima-windows-amd64-v0.5-dev-amd-strix-halo-20260612.exe
(source commit fa35aa4) on top of the HIP-engine build. Adds, vs the 20260610 exe:

  #87 native deploy readiness uses the real runtime name (no false "not ready")
  #88 deploy launcher hidden (no cmd.exe console popup) via VBS launcher
  #89 Qwen2.5-VL-3B-Instruct catalog knowledge (vlm + aliases + verified perf)
  #90 zero-config vision: llama.cpp --mmproj auto-wired for VL gguf models
  #91 openclaw sync preflight-probes :6188 and warns loudly when unreachable

serve.bat now points at the 20260612 exe; older builds kept for rollback.
README build table + fixes list + OpenClaw data-plane guidance updated.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant