[Skill proposal] `serving-llms-on-instinct`

### Proposed skill name

serving-llms-on-instinct

### Does something like this already exist?

Yes — as documentation, a runbook, or internal guide

### Where should this skill live?

Path B: authored in a product repo (HIP, ROCm, Ryzen AI, Lemonade, ...) and registered here

### Catalog focus area

Cross-stack porting

### Skill description

**Description**: Deploy and optimize LLM inference on AMD Instinct GPUs. Covers the full path from "I want to serve a model" to a running, benchmarked endpoint, including a DevCloud on-ramp for developers who don't have AMD hardware yet.

**Flow**: Trigger run -> Detect GPU ( if not found, trigger AMD Developer cloud setup) -> Decide VLLM vs SGLang Engine selection, and its Attention backends ( AITER, FA etc) ->Quark -> Env Vars -> Runtime


<img width="111" height="150" alt="Image" src="https://github.com/user-attachments/assets/11ca63f6-d81d-4300-9202-5c8389909263" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Skill proposal] `serving-llms-on-instinct` #30

Proposed skill name

Does something like this already exist?

Where should this skill live?

Catalog focus area

Skill description

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

[Skill proposal] serving-llms-on-instinct #30

Description

Proposed skill name

Does something like this already exist?

Where should this skill live?

Catalog focus area

Skill description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

[Skill proposal] `serving-llms-on-instinct` #30