You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have figured out the problem, accumulate to <8xf16> is only available for wave64 mode, it should be <16xf16> for wave32 mode. And the compile select wave32 mode by default for ROCm backend. But the accumulate matrix spec in MMAIntrinsic::WMMA_F16_16x16x16_F16 tie for vector.contract will only select the <8xf16> accumulate strategy.
The existing layout for the intrinsic was for subgroup=64 but we are
using subgroup=32 so it lead to this error
#18060
This PR fixes this to use the correct layout for subgroup=32 hence fixes#18060 and
#17807
What happened?
attention_and_matmul_spec_wmma.mlir: https://sharkpublic.blob.core.windows.net/sharkpublic/specs/no_pad/attention_and_matmul_spec_wmma.mlir
clip MLIR:
https://sharkpublic.blob.core.windows.net/sharkpublic/ean/sdxl-turbine/debug/stable_diffusion_xl_base_1_0_bs1_64_fp16_prompt_encoder_rocm.mlir
Steps to reproduce your issue
Setup latest iree-compiler from IREE main branch (issue reproduced on version
iree-compiler-20240703.943
)Download artifacts (mlir files) from above azure links.
run compile:
see error.
What component(s) does this issue relate to?
No response
Version information
First encountered on source build of sdxl_quantized branch (6cc8afe) but reproduced on latest published wheels (20240703.943).
Additional context
No response
The text was updated successfully, but these errors were encountered: