Add conversions for 1x1 conv_2d to matmul #18736

IanWood1 · 2024-10-09T21:25:54Z

Convert 1x1 conv_2d to linalg.matmul ops when the HW dimensions are dynamic and convert linalg.conv_2d_nhwc_hwcf when the N dimension is not 1. No change to linalg.conv_2d_nchw_fchw currently (see linked issue for discussion). Matmul is simpler and easier for the compiler to understand, allowing for better optimizations.

Signed-off-by: Ian Wood <[email protected]>

hanhanW · 2024-10-10T17:10:12Z

Can you make the PR description more descriptive when it is ready for review? Here are some examples: https://google.github.io/eng-practices/review/developer/cl-descriptions.html

Max191

Can you support the NCHW case as well? I added a suggestion for how to do it.

Additional thought: It may be a reasonable idea to create a linalg.generic op with expanded H and W dims (so one M/N contraction dim for each of the N, H, and W convolution dims). Having fewer reshapes is probably better at GlobalOptimization level, but I am not sure if it is worth it yet. I think the collapse_shape on the H and W dimensions can typically be propagated through the producers/consumers, so it might not make a difference. You may have a better idea of which form is better for dispatch region formation.

Also, this additional thought is not a requirement for this PR, just an idea to try afterwards.

compiler/src/iree/compiler/GlobalOptimization/Convert1X1FilterConv2DToMatmul.cpp

Use linalg::generalizeNamedOp to generalize 1x1 conv and then remove the unit extent affine symbols from input's affine map. Additionally, clean up the test cases (since expand/extract is introduced). Just check that the affine maps are correct. Signed-off-by: Ian Wood <[email protected]>

Signed-off-by: Ian Wood <[email protected]>

Max191

I think we may not even need this pass anymore. Does it still work if you simply add a case for 1x1 filters here?

iree/compiler/src/iree/compiler/GlobalOptimization/GeneralizeLinalgNamedOps.cpp

Lines 40 to 48 in c6056d1

    
           if (isa_and_nonnull<linalg::AbsOp, linalg::AddOp, linalg::BroadcastOp, 
        
                               linalg::CeilOp, linalg::CopyOp, linalg::DivOp, 
        
                               linalg::DivUnsignedOp, linalg::ElemwiseBinaryOp, 
        
                               linalg::ElemwiseUnaryOp, linalg::ExpOp, linalg::FloorOp, 
        
                               linalg::LogOp, linalg::MapOp, linalg::MaxOp, 
        
                               linalg::MulOp, linalg::NegFOp, linalg::ReduceOp, 
        
                               linalg::SubOp, linalg::TransposeOp>( 
        
                   linalgOp.getOperation())) { 
        
             namedOpCandidates.push_back(linalgOp);

I would expect generalizing the op would be enough, and hopefully the indexing maps will simplify to contraction maps.

.github/workflows/pkgci_regression_test.yml

Signed-off-by: Ian Wood <[email protected]>

compiler/src/iree/compiler/Preprocessing/Passes.cpp

IanWood1 · 2024-10-16T17:57:38Z

I think we may not even need this pass anymore. Does it still work if you simply add a case for 1x1 filters here?

iree/compiler/src/iree/compiler/GlobalOptimization/GeneralizeLinalgNamedOps.cpp

Lines 40 to 48 in c6056d1

if (isa_and_nonnull<linalg::AbsOp, linalg::AddOp, linalg::BroadcastOp,

linalg::CeilOp, linalg::CopyOp, linalg::DivOp,

linalg::DivUnsignedOp, linalg::ElemwiseBinaryOp,

linalg::ElemwiseUnaryOp, linalg::ExpOp, linalg::FloorOp,

linalg::LogOp, linalg::MapOp, linalg::MaxOp,

linalg::MulOp, linalg::NegFOp, linalg::ReduceOp,

linalg::SubOp, linalg::TransposeOp>(

linalgOp.getOperation())) {

namedOpCandidates.push_back(linalgOp);

I would expect generalizing the op would be enough, and hopefully the indexing maps will simplify to contraction maps.

The affine_maps dont get simplified to contraction maps until FoldUnitExtentDimsPass which is directly after this pass.

Max191 · 2024-10-17T16:25:38Z

The affine_maps dont get simplified to contraction maps until FoldUnitExtentDimsPass which is directly after this pass

Nice, that's what I was hoping for! As long as the unit dims are getting folded then it should be good.

MaheshRavishankar

I think changes to generalize is fine, but maybe keep the conv-> matmul conversion pass around (doesnt have to be used in the global optimization pass pipeline).

IanWood1 added 3 commits October 9, 2024 21:24

Add conversion for NHWC

1cdc93a

Signed-off-by: Ian Wood <[email protected]>

Support dynamic H/W

4425d27

Signed-off-by: Ian Wood <[email protected]>

Add dep & adjust dispatch count

e3e11f5

Signed-off-by: Ian Wood <[email protected]>

IanWood1 linked an issue Oct 10, 2024 that may be closed by this pull request

[GlobalOptimization] 1x1 filter convolutions not converted to matmul #18710

Open

IanWood1 marked this pull request as ready for review October 10, 2024 16:09

IanWood1 requested review from hanhanW and ScottTodd as code owners October 10, 2024 16:09

hanhanW requested a review from Max191 October 10, 2024 17:06

IanWood1 changed the title ~~Add conversion for 1x1 conv_2d~~ Add conversions for 1x1 conv_2d to matmul Oct 10, 2024

Max191 requested changes Oct 11, 2024

View reviewed changes

compiler/src/iree/compiler/GlobalOptimization/Convert1X1FilterConv2DToMatmul.cpp Outdated Show resolved Hide resolved

ScottTodd removed their request for review October 15, 2024 17:01

IanWood1 force-pushed the conv_to_matmul branch from b934b78 to c93d89d Compare October 15, 2024 18:30

Add print-local-scope flag

fcf7427

Signed-off-by: Ian Wood <[email protected]>

Max191 reviewed Oct 16, 2024

View reviewed changes

.github/workflows/pkgci_regression_test.yml Outdated Show resolved Hide resolved

IanWood1 added 2 commits October 16, 2024 17:20

Remove Convert1x1FilterConv2d files

ade2659

Signed-off-by: Ian Wood <[email protected]>

Add conv generalization to GeneralizeLinalgNamedOps

2a0b97e

Signed-off-by: Ian Wood <[email protected]>

IanWood1 requested review from qedawkins and MaheshRavishankar as code owners October 16, 2024 17:36

Lower unet dispatch count and remove dep

fc4d11f

Signed-off-by: Ian Wood <[email protected]>

IanWood1 force-pushed the conv_to_matmul branch from 932c9d2 to fc4d11f Compare October 16, 2024 17:38

IanWood1 commented Oct 16, 2024

View reviewed changes

compiler/src/iree/compiler/Preprocessing/Passes.cpp Show resolved Hide resolved

Merge branch 'main' into conv_to_matmul

bc60a37

MaheshRavishankar requested changes Oct 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add conversions for 1x1 conv_2d to matmul #18736

Add conversions for 1x1 conv_2d to matmul #18736

IanWood1 commented Oct 9, 2024 •

edited

Loading

hanhanW commented Oct 10, 2024

Max191 left a comment

Max191 left a comment

IanWood1 commented Oct 16, 2024

Max191 commented Oct 17, 2024

MaheshRavishankar left a comment

	if (isa_and_nonnull<linalg::AbsOp, linalg::AddOp, linalg::BroadcastOp,
	linalg::CeilOp, linalg::CopyOp, linalg::DivOp,
	linalg::DivUnsignedOp, linalg::ElemwiseBinaryOp,
	linalg::ElemwiseUnaryOp, linalg::ExpOp, linalg::FloorOp,
	linalg::LogOp, linalg::MapOp, linalg::MaxOp,
	linalg::MulOp, linalg::NegFOp, linalg::ReduceOp,
	linalg::SubOp, linalg::TransposeOp>(
	linalgOp.getOperation())) {
	namedOpCandidates.push_back(linalgOp);

Add conversions for 1x1 conv_2d to matmul #18736

Are you sure you want to change the base?

Add conversions for 1x1 conv_2d to matmul #18736

Conversation

IanWood1 commented Oct 9, 2024 • edited Loading

hanhanW commented Oct 10, 2024

Max191 left a comment

Choose a reason for hiding this comment

Max191 left a comment

Choose a reason for hiding this comment

IanWood1 commented Oct 16, 2024

Max191 commented Oct 17, 2024

MaheshRavishankar left a comment

Choose a reason for hiding this comment

IanWood1 commented Oct 9, 2024 •

edited

Loading