MoE AITER Triton Kernels Integration#1044
Open
amirumoAMD wants to merge 35 commits into
Open
Conversation
f18e6a3 to
1b9fd3b
Compare
valarLip
reviewed
Jun 3, 2026
6087e58 to
9268382
Compare
1 task
…ration, compiles on profile_offline similar to without triton enabled
…16w4 + expt data setup changes
…ports from triton_kernels
…ue to routing from topk handling
785a0bc to
71d4458
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
Replace triton_kernels module with aiter kernels. Add support for gpt-oss a8w4.
Technical Details
matmul_ogs now replaced by a16w4 moe gemm from aiter. custom routing redirects to updated expanded/unified aiter routing function. correlates to changes on aiter branch amemoore/gfx950-moe-triton-integration.
Test Plan
lm_eval matches expected result of non-triton run for mxfp4 weight models (DSr1 mxfp4, gpt-oss a8w4, gpt-oss regular).
Test Result
All three match
Submission Checklist