Skip to content

[QST] I am writing a mix-dtype input gemm for blackwell but no example #3138

@haowen-han

Description

@haowen-han

What is your question?
I am developing a GEMM operator for the Blackwell GPU where the B matrix uses MXFP4 and the A matrix uses MXFP8. However, there are no relevant examples for either CUTE C++ or CUTE DSL. What's more, CUTE DSL even requires the data types of the A and B matrices input to MMA to be identical, but in reality, the Blackwell architecture supports mismatched data types. Could anyone provide relevant examples?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions