[Codegen][GPU] Finish splitting NV intrinsics from AMD ones #19853

qedawkins · 2025-01-30T16:21:20Z

The split of WMMA_* enums into Nvidia and AMD variants was half finished. This completely splits the handling of each vendor. In the process, because concrete layouts for nvidia intrinsics is unimplemented, the only supported case is opaque layouts via SPIR-V. This required re-introducing getMNKShape per enum value rather than inferring it from the layout.

This PR is effectively NFC, but unblocks enabling LLVMGPUTileAndFuse by default for matmuls.

The split of `WMMA_*` enums into Nvidia and AMD variants was half finished. This completely splits the handling of each vendor. In the process, because concrete layouts for nvidia intrinsics is unimplemented, the only supported case is opaque layouts via SPIR-V. This required re-introducing `getMNKShape` per enum value rather than inferring it from the layout. This PR is effectively NFC, but unblocks enabling LLVMGPUTileAndFuse by default for matmuls.

compiler/src/iree/compiler/Codegen/Dialect/GPU/IR/IREEGPUAttrs.cpp

…#19853) The split of `WMMA_*` enums into Nvidia and AMD variants was half finished. This completely splits the handling of each vendor. In the process, because concrete layouts for nvidia intrinsics is unimplemented, the only supported case is opaque layouts via SPIR-V. This required re-introducing `getMNKShape` per enum value rather than inferring it from the layout. This PR is effectively NFC, but unblocks enabling LLVMGPUTileAndFuse by default for matmuls. Signed-off-by: Hyunsung Lee <[email protected]>

qedawkins requested review from bjacob, Max191 and raikonenfnu January 30, 2025 16:21

qedawkins requested a review from antiagainst as a code owner January 30, 2025 16:21

bjacob reviewed Jan 30, 2025

View reviewed changes

compiler/src/iree/compiler/Codegen/Dialect/GPU/IR/IREEGPUAttrs.cpp Outdated Show resolved Hide resolved

keep single source of truth for amd

c3a566f

qedawkins requested a review from bjacob January 31, 2025 16:22

bjacob approved these changes Jan 31, 2025

View reviewed changes

qedawkins merged commit 0159762 into iree-org:main Jan 31, 2025
41 checks passed

qedawkins deleted the finish_nv_intrinsics branch January 31, 2025 19:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Codegen][GPU] Finish splitting NV intrinsics from AMD ones #19853

[Codegen][GPU] Finish splitting NV intrinsics from AMD ones #19853

qedawkins commented Jan 30, 2025

[Codegen][GPU] Finish splitting NV intrinsics from AMD ones #19853

[Codegen][GPU] Finish splitting NV intrinsics from AMD ones #19853

Conversation

qedawkins commented Jan 30, 2025