Skip to content

[CIR][CUDA] multi-arch support for the CIR offload-merge pipeline#9

Open
RiverDave wants to merge 1 commit into
06-18-_cir_introduce_the_offload_merge_driver_pipeline_for_cuda_single_arch_from
users/riverdave/cir/cuda-offload-merge-multi-arch
Open

[CIR][CUDA] multi-arch support for the CIR offload-merge pipeline#9
RiverDave wants to merge 1 commit into
06-18-_cir_introduce_the_offload_merge_driver_pipeline_for_cuda_single_arch_from
users/riverdave/cir/cuda-offload-merge-multi-arch

Conversation

@RiverDave

@RiverDave RiverDave commented Jun 27, 2026

Copy link
Copy Markdown
Owner

Depends on: #7

This adds CUDA multi-arch support to the CIR offload-merge pipeline.

The stock CUDA driver compiles each GPU arch separately, then collapses the per-arch device outputs into one CUDA fatbinary before passing it to the host via -fcuda-include-gpubinary. The CIR merge path now mirrors that: after splitting the combined CIR container, each device module resumes backend/assemble for its own arch, and the resulting outputs are packaged into one fatbin for the resumed host compile.

This also makes OffloadAction preserve the bound arch per device dependence. The split action is a shared node, so the arch cannot live only on the action itself; each dependence needs to carry the arch it is selecting from the split.

Copy link
Copy Markdown
Owner Author

This stack of pull requests is managed by Graphite. Learn more about stacking.

@RiverDave RiverDave changed the title [CIR][CUDA] Support multi-arch offload merge fatbins [CIR][CUDA] multi-arch support to the CIR offload-merge pipeline Jun 27, 2026
@RiverDave RiverDave changed the title [CIR][CUDA] multi-arch support to the CIR offload-merge pipeline [CIR][CUDA] multi-arch support for the CIR offload-merge pipeline Jun 27, 2026
@RiverDave RiverDave requested a review from koparasy June 27, 2026 09:42
@RiverDave RiverDave marked this pull request as ready for review June 27, 2026 09:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant