[VectorDistribution] Reuse intrinsic layout in chained gemm #18505

Groverkss · 2024-09-12T14:46:21Z

This patch teaches attention codegen pipeline to reuse the intrinsic layout of output of the first matmul as the lhs of the second matmul. This is possible for 16x16x16 and 32x32x8 MFMA intrinsic layouts.

compiler/src/iree/compiler/Codegen/Common/GPU/GPUNestedLayoutDistributionPatterns.cpp

compiler/src/iree/compiler/Codegen/LLVMGPU/LLVMGPUConfigureTensorLayouts.cpp

compiler/src/iree/compiler/Codegen/LLVMGPU/test/ROCDL/pipeline_vector_distribute_gfx940.mlir

kuhar

LGTM but you may want to wait for thumbs up from Quinn/Mahesh too

Groverkss added 7 commits September 19, 2024 10:19

reuse intrinsic

fedb374

Fix 32x32x8 intrinsic

5093429

bit more docs

a6cf102

BAZEL

8ebd127

Fix tests

7172237

rebase

ba5f0a2

Add tests

d8f7aad

Groverkss force-pushed the users/Groverkss/reuse-attention-intrinsic branch from 67a72fa to d8f7aad Compare September 19, 2024 10:50

Groverkss marked this pull request as ready for review September 19, 2024 10:54

Groverkss requested review from MaheshRavishankar, qedawkins, kuhar and antiagainst as code owners September 19, 2024 10:54

kuhar reviewed Sep 19, 2024

View reviewed changes

Address comments

5281d92

Groverkss requested a review from kuhar September 19, 2024 20:09

kuhar approved these changes Sep 20, 2024

View reviewed changes

MaheshRavishankar approved these changes Sep 20, 2024

View reviewed changes

Groverkss merged commit 914858f into iree-org:main Sep 20, 2024
33 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VectorDistribution] Reuse intrinsic layout in chained gemm #18505

[VectorDistribution] Reuse intrinsic layout in chained gemm #18505

Groverkss commented Sep 12, 2024 •

edited

Loading

kuhar left a comment

[VectorDistribution] Reuse intrinsic layout in chained gemm #18505

[VectorDistribution] Reuse intrinsic layout in chained gemm #18505

Conversation

Groverkss commented Sep 12, 2024 • edited Loading

kuhar left a comment

Choose a reason for hiding this comment

Groverkss commented Sep 12, 2024 •

edited

Loading