CI: add a16w16 GEMM to op tuning workflow#3605
Conversation
🏷️ CI GuideRuns automatically on every PR:
Extended tests (opt-in via labels):
|
There was a problem hiding this comment.
Pull request overview
This PR extends the manual “Operators Tuning” workflow to include the BF16 GEMM tuner (gemm_a16w16) so the workflow can refresh aiter/configs/bf16_tuned_gemm.csv from aiter/configs/bf16_untuned_gemm.csv via the existing op_tune.sh orchestration.
Changes:
- Add a new
gemm_a16w16entry to.github/scripts/op_tune.shso it can be selected/run like other tuning shapes. - Update the workflow-dispatch
shapesinput description to includegemm_a16w16in the examples.
Reviewed changes
Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.
| File | Description |
|---|---|
.github/workflows/operators-tuning.yaml |
Updates the manual workflow input help text to include gemm_a16w16 among example shapes. |
.github/scripts/op_tune.sh |
Adds gemm_a16w16 to the tuning job list, wiring it to the existing BF16 GEMM untuned/tuned CSV paths. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
| declare -a tune_jobs=( | ||
| "ck_batched_gemm_a8w8:csrc/ck_batched_gemm_a8w8:op_tests/test_batched_gemm_a8w8.py:python3 csrc/ck_batched_gemm_a8w8/batched_gemm_a8w8_tune.py -i aiter/configs/a8w8_untuned_batched_gemm.csv -o aiter/configs/a8w8_tuned_batched_gemm.csv" | ||
| "ck_batched_gemm_bf16:csrc/ck_batched_gemm_bf16:op_tests/test_batched_gemm_bf16.py:python3 csrc/ck_batched_gemm_bf16/batched_gemm_bf16_tune.py -i aiter/configs/bf16_untuned_batched_gemm.csv -o aiter/configs/bf16_tuned_batched_gemm.csv" | ||
| "gemm_a16w16:csrc/gemm_a16w16:op_tests/test_gemm_a16w16.py:python3 csrc/gemm_a16w16/gemm_a16w16_tune.py -i aiter/configs/bf16_untuned_gemm.csv -o aiter/configs/bf16_tuned_gemm.csv" |
There was a problem hiding this comment.
use this, python3 csrc/gemm_a16w16/gemm_tuner.py -i aiter/configs/bf16_untuned_gemm.csv -o aiter/configs/bf16_tuned_gemm.csv. Both are supported, but this can support tune with hipblaslt
Summary
gemm_a16w16job to the operator tuning script so bf16 GEMM tuning can refreshbf16_tuned_gemm.csvfrombf16_untuned_gemm.csv.gemm_a16w16in the manual Operators Tuning workflow shape examples.Test plan
bash -n .github/scripts/op_tune.shgit diff --check