Add Qwen2.5-VL-3B-Instruct model knowledge (vlm + verified perf) by rjckkkkk · Pull Request #89 · Approaching-AI/AIMA

rjckkkkk · 2026-06-11T07:28:25Z

What

Catalog model yaml for Qwen2.5-VL-3B-Instruct (type: vlm, aliases: [Qwen2.5-VL-3B-Instruct-Q4_K_M], openclaw.chat_provider: true, a universal llamacpp gguf variant, verified Strix Halo perf).

Why

Without a catalog entry the scanned GGUF was auto-detected with no type, so aima openclaw sync hit its default (skip) branch → the deployed model never became an OpenClaw provider (sync reported vlm=0). With the yaml, the model has a type → sync writes it as an aima provider and sets it as the default chat model.

Verified (AMD Strix Halo Win11 rig, llama.cpp b9330)

Deployed → text decode ~90 tok/s; vision verified (started with --mmproj, correctly identified a test image). After this yaml, aima openclaw sync → providers=aima, model qwen2.5-vl-3b-instruct, chatModel=aima/qwen2.5-vl-3b-instruct.

Note: image input requires llama-server's --mmproj (pass --config mmproj=<path>); auto-wiring the projector is a separate enhancement.

🤖 Generated with Claude Code

… perf) Without a catalog entry the scanned GGUF (Qwen2.5-VL-3B-Instruct-Q4_K_M) was auto-detected with no type, so `openclaw sync` skipped it (default case) and the model never surfaced as an OpenClaw provider. Add the model yaml: type vlm, metadata.aliases for the scan name, openclaw.chat_provider so it surfaces as a multimodal (text+image) chat provider, a universal llamacpp gguf variant, and verified perf (AMD Strix Halo / llama.cpp b9330: ~90 tok/s text decode; vision verified with --mmproj). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

) New version-stamped build aima-windows-amd64-v0.5-dev-amd-strix-halo-20260612.exe (source commit fa35aa4) on top of the HIP-engine build. Adds, vs the 20260610 exe: #87 native deploy readiness uses the real runtime name (no false "not ready") #88 deploy launcher hidden (no cmd.exe console popup) via VBS launcher #89 Qwen2.5-VL-3B-Instruct catalog knowledge (vlm + aliases + verified perf) #90 zero-config vision: llama.cpp --mmproj auto-wired for VL gguf models #91 openclaw sync preflight-probes :6188 and warns loudly when unreachable serve.bat now points at the 20260612 exe; older builds kept for rollback. README build table + fixes list + OpenClaw data-plane guidance updated. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Qwen2.5-VL-3B-Instruct model knowledge (vlm + verified perf)#89

Add Qwen2.5-VL-3B-Instruct model knowledge (vlm + verified perf)#89
rjckkkkk wants to merge 1 commit into
developfrom
feat/qwen2.5-vl-3b-model-knowledge

rjckkkkk commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rjckkkkk commented Jun 11, 2026

What

Why

Verified (AMD Strix Halo Win11 rig, llama.cpp b9330)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant