Tile and distribute linalg.generic in DispatchLinalgOnTensor #5159

ThomasRaoux · 2021-03-18T18:35:18Z

No description provided.

MaheshRavishankar

Looks awesome! I am super excited about this landing! It address a lot of hidden issues by enabling this path.

Just a nit on comments for setting the launch config. Once it is addressed, it is good to go.

MaheshRavishankar · 2021-03-18T22:11:15Z

iree/compiler/Conversion/LinalgToSPIRV/KernelDispatchUtils.cpp

+  ShapedType outputShape = op.getOutputShapedType(0);
+
+  SmallVector<int64_t, 4> sizes;
+  // When Vectororization is not enabled the convertToGPU pass assumes that that


Misleading comment. the ConvertToGPU pass does not assume a tile size right? I am confused cause the launch config isnt even called in the ConvertToGPU Pass.

Good catch, you're right I had the comment backward. The problem is that when vectorization is off the second level of tiling is skipped and we end up mapping one element to one thread. So the real problem is that if we picked something different in LaunchConfig the number of workgroup is wrong. I updated the comment, hopefully this is clear.

MaheshRavishankar · 2021-03-18T22:11:31Z

iree/compiler/Conversion/LinalgToSPIRV/KernelDispatchUtils.cpp

+    sizes.append({4 * subgroupSize, 2 * subgroupSize});
+  }
+  sizes.push_back(subgroupSize);
+  int64_t lowerTs = config.workgroupSize[0];


Could you add comments to whats happening here. Hard to parse this a bit.

MaheshRavishankar · 2021-03-18T22:12:58Z

iree/compiler/Dialect/Flow/Transforms/DispatchLinalgOnTensors.cpp

-    if (isa<linalg::ConvInputNHWCFilterHWCFOp,
-            linalg::DepthwiseConvInputNHWCFilterHWCOp>(op)) {
-      count.erase(count.begin());
+    size_t numParrallelLoops = getNumOuterParallelLoops(op);


Oh Nice! This was exactly what I was trying to do. Should work well with Hanhan's PR #5136

MaheshRavishankar · 2021-03-18T22:16:51Z

iree/compiler/Dialect/Flow/Transforms/DispatchLinalgOnTensors.cpp

+
+    // As a second step mark all the element-wise linalg ops not fused as roots
+    // so that they get tiled and distributed.
+    for (linalg::LinalgOp linalgOp : linalgOps) {


So the only reason we have the MakeDispatchWorkgroupsOp pattern is for the fall back right? In other words in the ConvertToGPU pass, the mapping of iterations of linalg operation to global invocation ID is not used in the Linalg to tensors path. That would be great, cause then we can move all that loop distribution logic all the way in Flow.

Correct, there is still few cases that don't tile, I can look at those at some point. Yes for all those cases the ConverToGPU global invocation ID is not reached. It would be good to do the loop distribution in flow indeed.
The main case I have seen remaining is when the linalg affine map are not projected permutation.

ThomasRaoux · 2021-03-18T22:52:49Z

Thanks Mahesh, please take another look.

MaheshRavishankar · 2021-03-18T23:44:51Z

iree/compiler/Conversion/LinalgToSPIRV/KernelDispatchUtils.cpp

+  sizes.push_back(subgroupSize);
+  // Use the first tile size that can divide the shape. If the shape is not
+  // aligned on any of the tile sizes pick the smallest tile of one element per
+  // thread.


Please add a TODO here. FOr the linalg on tensors path this could actually be anything other than 1. It can be even on the old path with a few changes, but why bother when its going away. For now, leave a TODO and come back to it when we deprecate the old path.

MaheshRavishankar · 2021-03-18T23:46:36Z

iree/compiler/Conversion/LinalgToSPIRV/KernelDispatchUtils.cpp

+    for (linalg::LinalgOp linalgOp : linalgOps) {
+      if (auto op = dyn_cast<linalg::GenericOp>(linalgOp.getOperation())) {
+        if (getNumOuterParallelLoops(linalgOp) == 0 ||
+            llvm::any_of(linalgOp.getIndexingMaps(), [](AffineMap &map) {


It might just be getting mixed with the old paths. This should not be needed actually.

I still need this check otherwise some linalg op with weird affine map crash during tiling. I can look at why this happen after if you think this is not expected.

Yeah, this is not expected. I dont see why it would carsh (at least would be good to know the affine map. Is this because of reverse? If so worth triaging upstream.

Mahesh, yes the problem is with reverse, I'll work on fixing this part in a second step.

MaheshRavishankar · 2021-03-18T23:47:29Z

iree/compiler/Conversion/LinalgToSPIRV/KernelDispatchUtils.cpp

+  if (!rootOperation) {
+    for (linalg::LinalgOp linalgOp : linalgOps) {
+      if (auto op = dyn_cast<linalg::GenericOp>(linalgOp.getOperation())) {
+        if (getNumOuterParallelLoops(linalgOp) == 0 ||


When we drop the old path, this will have to go through the new path.

we still need this even in the new path right?

Sorry, bad comment. I was saying that this is basically saying dont do anything with 0-rank operations. But those need to be handled as well (i.e. lowered to loops albeit no-op) at some point. I am guessing this isnt tested on the new path.
One thing to do here is just test what happens if a linalg.generic is just 0-rank operations.

Good point, I can add a test for that.

ThomasRaoux · 2021-03-19T00:59:33Z

I'll merge this PR, if there are still concerns with your comments I can address it after.

hanhanW · 2021-03-19T06:39:55Z

This seems to break compiling mobilenet_v2 in iree-android-benchmark.

To repro:

iree-translate --iree-mlir-to-vm-bytecode-module --iree-hal-target-backends=vulkan-spirv ~/mobilenet_v2.mlir -o a.vmfb --iree-spirv-enable-vectorization --iree-vulkan-target-triple=valhall-g77-unknown-android10

The mobilenet_v2.mlir can be downloaded from https://storage.googleapis.com/iree-model-artifacts/mobilenet-v2.tar.gz

It fails in ConvertToSPIRVPass. The input IR of ConvertToSPIRVPass: https://gist.github.com/hanhanW/1f1c5cb70ff1486c04783ea052e2703d

The error message is failed to legalize operation 'vector.broadcast'

antiagainst · 2021-03-19T13:36:38Z

Oh, a bunch of size-1 vectors are generated. Those aren't supported by SPIR-V. (They should just be scalars.)

@hanhanW: This is still checking the existing path? I think we should switch the benchmark tests to go via the new path.

Also it would be nice to check in a faked weights MobileNet v2. Discovering breakage via benchmarking tests is not that good. :)

@ThomasRaoux: can we revert this for now and have the issues fixed before landing it?

ThomasRaoux · 2021-03-19T14:16:11Z

Oh, a bunch of size-1 vectors are generated. Those aren't supported by SPIR-V. (They should just be scalars.)

@hanhanW: This is still checking the existing path? I think we should switch the benchmark tests to go via the new path.

Also it would be nice to check in a faked weights MobileNet v2. Discovering breakage via benchmarking tests is not that good. :)

@ThomasRaoux: can we revert this for now and have the issues fixed before landing it?

Yes, let me revert it and I'll fix it offline.

…iree-org#5159)" This reverts commit 156f0bb.

…#5159)" (#5170) This reverts commit 156f0bb.

Make linalg generic be tiled in DispatchLinalgOnTensor

1019302

google-cla bot added the cla: yes label Mar 18, 2021

ThomasRaoux requested a review from MaheshRavishankar March 18, 2021 18:36

MaheshRavishankar requested changes Mar 18, 2021

View reviewed changes

Fix and add comments for the launchConfig choices.

5ce882c

ThomasRaoux requested a review from MaheshRavishankar March 18, 2021 22:37

MaheshRavishankar approved these changes Mar 18, 2021

View reviewed changes

Add a TODO

9e947f9

ThomasRaoux merged commit 156f0bb into iree-org:main Mar 19, 2021

ThomasRaoux added a commit to ThomasRaoux/iree that referenced this pull request Mar 19, 2021

Revert "Tile and distribute linalg.generic in DispatchLinalgOnTensors (…

05594db

…iree-org#5159)" This reverts commit 156f0bb.

ThomasRaoux mentioned this pull request Mar 19, 2021

Revert "Tile and distribute linalg.generic in DispatchLinalgOnTensors… #5170

Merged

ThomasRaoux added a commit that referenced this pull request Mar 19, 2021

Revert "Tile and distribute linalg.generic in DispatchLinalgOnTensors (…

5582b5a

…#5159)" (#5170) This reverts commit 156f0bb.

not-jenni mentioned this pull request Mar 19, 2021

Merge main -> google #5176

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tile and distribute linalg.generic in DispatchLinalgOnTensor #5159

Tile and distribute linalg.generic in DispatchLinalgOnTensor #5159

ThomasRaoux commented Mar 18, 2021

MaheshRavishankar left a comment

MaheshRavishankar Mar 18, 2021

ThomasRaoux Mar 18, 2021

MaheshRavishankar Mar 18, 2021

MaheshRavishankar Mar 18, 2021

MaheshRavishankar Mar 18, 2021

ThomasRaoux Mar 18, 2021

ThomasRaoux commented Mar 18, 2021

MaheshRavishankar Mar 18, 2021

MaheshRavishankar Mar 18, 2021

ThomasRaoux Mar 19, 2021

MaheshRavishankar Mar 19, 2021

ThomasRaoux Mar 19, 2021

MaheshRavishankar Mar 18, 2021

ThomasRaoux Mar 19, 2021

MaheshRavishankar Mar 19, 2021

ThomasRaoux Mar 19, 2021

ThomasRaoux commented Mar 19, 2021

hanhanW commented Mar 19, 2021

antiagainst commented Mar 19, 2021

ThomasRaoux commented Mar 19, 2021

Tile and distribute linalg.generic in DispatchLinalgOnTensor #5159

Tile and distribute linalg.generic in DispatchLinalgOnTensor #5159

Conversation

ThomasRaoux commented Mar 18, 2021

MaheshRavishankar left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ThomasRaoux commented Mar 18, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ThomasRaoux commented Mar 19, 2021

hanhanW commented Mar 19, 2021

antiagainst commented Mar 19, 2021

ThomasRaoux commented Mar 19, 2021