[External codegen] Add test cases for fused ops with manual annotation #4741

masahi · 2020-01-19T01:24:57Z

This PR contains

A custom annotator which detects conv + bias add + relu ops
An example of applying FoldScaleAxis and FoldConstant to layers of conv + bn + relu to get conv + bias add + relu ops which the annotator can detect (before partitioning)
dnnl runtime support for conv + bias add + relu op using its post_ops feature.
Updates on CodegenDNNL which enable translating fused Relay conv + bias add + relu ops to dnnl counterpart.
Test cases on a simple network and mobilenet which demonstrate features above.

The result of partitioning mobilenet is dumped here.

zhiics · 2020-01-19T17:08:16Z

src/relay/backend/contrib/dnnl/codegen.cc

+    };
+
+    Output ret;
+    if (auto conv_call = DetectFusedConv2DBiasReLU(call)) {


I am not sure if we really want to handle fused op from relay for external codegen. This looks quite ad-hoc to me. You may have countless combinations.

The idea is for it to serve as an example of handling fused ops inside external codegen. I assume dnnl backend itself is not meant to be used in production; The purpose is to be a more realistic example than CodegenC, so I thought why don't we add an example of how to handle fused ops. I never intended to cover other fusion cases.

Since we are trying to be so nice to new backend implementers (who might not be familiar with TVM internals) as to add convenient op level annotation and semi automatic fusion mechanism etc for them, I don't think it is reasonable to expect them to figure out how to handle more complicated but often common cases (like fusion) and everything else on their own. Hope this make sense.

Another usage scenario which I think is going to be common is translation from quantized Relay models. It would be great to add an example of translating QNN subgraphs to backend implementations, for example. Without it, it is not obvious how to go about it.

Since DNNL has quantization support and everyone can use it, it would serve as a good example and test case.

While I agree with you that it's fine to handle fusion in this DNNL codegen, I also agree with @zhiics that the current implementation is a bit too ad-hoc even it's only used for demo purpose for now. As you have implemented, MKL DNN uses set_post_ops to attach ops to be fused. I think this part could be more general. For example:

if call == "relu": visit(arg) if this->curr_layer == "conv2d": generate_post_ops(call) else: generate_a_layer(call)

In this way, the codegen is able to deal with all MKL DNN supported conv2d fusion (conv2d, conv2d+add, conv2d+add+relu). We could still put heuristic pattern annotations to the annotator and improve it gradually. I like the one you made for conv2d+bias+relu in this PR, for instance.

yeah, this is my minimal effort way to detect only the pattern I care about. Will think about how to make it more general.

I can go ahead and implement this, but that would duplicate pattern matching logic that I already have in my python annotator. That sounds bad and it would become a perfect anti-example mentioned in the RFC below :)

I think I should close this one and wait for a better solution to be ready. I will wait for your input for now @comaniac @zhiics

https://discuss.tvm.ai/t/rfc-external-codegen-defining-composite-relay-operators/5470/

Yeah, I had a brief discussion with @u99127 before. I will read the discussion more carefully and probably we can discuss from there and try to have some consensus on a design/implementation. Sorry for being late/slow because I am on vacation.

I can also leave the current dumb implementation as it is, with the understanding that

This is a temporary solution

It will serve as a concrete motivation and test case for validating a more general mechanism to be introduced

Trying to be a bit more clever and duplicating an entire state machine logic here do not seem worth it to me anymore. Either way I'm fine.

masahi · 2020-01-19T18:08:28Z

@zhiics I'm not trying to make DNNL backend more feature complete. I want to add examples and test cases of typical usage scenarios that most backend implementers are likely to encounter.

We talked on the forum that fusion is already possible with manual annotation. But there is no example which demonstrates that. This PR fill this gap.

masahi · 2020-01-19T19:41:06Z

I add a link below where I clarified my intention. Hopefully this clears up some confusion.
https://discuss.tvm.ai/t/solved-external-codegen-how-the-runtime-determines-function-signatures-for-generated-functions/5455/7

comaniac

Thanks for the PR. Overall looks good to me but just some miner points. Please see comments for details.

tests/python/relay/test_pass_partition_graph.py

comaniac · 2020-01-20T04:03:48Z

src/relay/backend/contrib/dnnl/codegen.cc

+    };
+
+    Output ret;
+    if (auto conv_call = DetectFusedConv2DBiasReLU(call)) {


While I agree with you that it's fine to handle fusion in this DNNL codegen, I also agree with @zhiics that the current implementation is a bit too ad-hoc even it's only used for demo purpose for now. As you have implemented, MKL DNN uses set_post_ops to attach ops to be fused. I think this part could be more general. For example:

if call == "relu": visit(arg) if this->curr_layer == "conv2d": generate_post_ops(call) else: generate_a_layer(call)

In this way, the codegen is able to deal with all MKL DNN supported conv2d fusion (conv2d, conv2d+add, conv2d+add+relu). We could still put heuristic pattern annotations to the annotator and improve it gradually. I like the one you made for conv2d+bias+relu in this PR, for instance.

python/tvm/relay/build_module.py

comaniac · 2020-02-10T21:18:38Z

As #4771 has been merged, we can revisit this PR for DNNL fuse patterns.

masahi · 2020-02-10T21:35:51Z

yes I want to update this PR but we don't have a way to hook Composite and Compiler attributes yet, so I couldn't "see" a composite conv + bias + relu in CodegenDNNL atm. Refer to the comments below.
#4771 (comment)
#4771 (comment)

masahi · 2020-04-08T03:28:42Z

#5272

masahi added the status: need review label Jan 19, 2020

masahi changed the title ~~[Partitioning] Add test cases for fused ops with manual annotation~~ [External codegen] Add test cases for fused ops with manual annotation Jan 19, 2020

zhiics reviewed Jan 19, 2020

View reviewed changes

comaniac requested changes Jan 20, 2020

View reviewed changes

mbaret reviewed Jan 20, 2020

View reviewed changes

python/tvm/relay/build_module.py Outdated Show resolved Hide resolved

masahi mentioned this pull request Jan 20, 2020

Expose relay BindParamsByName to Python #4751

Merged

masahi force-pushed the partition-fused-ops branch 3 times, most recently from dd7046b to 3dbce0f Compare January 24, 2020 06:24

masahi added 18 commits January 24, 2020 20:25

Add partition test case for conv + bias + relu pattern

f2b9b1f

partitioning mobilenet works

99e0233

enable all tests

c72b071

introduce bind_params_by_name as reusable api

693474b

remove unused function

e902fa4

add fused dnnl op

50e78b8

refactoring dnnl codegen

0d41ce7

cleanup

17f825f

add pattern detection

40a4167

improve Expr to CallNode* conversion

03fba62

add fuse test

c572c6b

uncomment other tests

a56829b

add compiler_begin on bias param

92e06c4

enable other tests

264e054

minor fix

3ef57f6

fixed test on simple net

ee50eed

rebase and address comments

2e2da6b

rebase fix

af627a9

masahi force-pushed the partition-fused-ops branch from 3dbce0f to af627a9 Compare January 24, 2020 11:38

masahi mentioned this pull request Mar 12, 2020

[Quantization] Any way to simulate asymmetric quantization? oneapi-src/oneDNN#665

Closed

This was referenced Apr 7, 2020

[RELAY][BYOC] Register pattern tables from external codegens #5262

Merged

[RELAY][BYOC] Add support for composite functions in BYOC #5261

Merged

[BYOC] Add example of Composite + Annotate for DNNL fused op #5272

Merged

masahi closed this Apr 8, 2020

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[External codegen] Add test cases for fused ops with manual annotation #4741

[External codegen] Add test cases for fused ops with manual annotation #4741

masahi commented Jan 19, 2020 •

edited

Loading

zhiics Jan 19, 2020

masahi Jan 19, 2020 •

edited

Loading

masahi Jan 19, 2020 •

edited

Loading

comaniac Jan 20, 2020

masahi Jan 20, 2020

masahi Jan 20, 2020

zhiics Jan 20, 2020

masahi Jan 20, 2020

masahi commented Jan 19, 2020 •

edited

Loading

masahi commented Jan 19, 2020 •

edited

Loading

comaniac left a comment

comaniac Jan 20, 2020

comaniac commented Feb 10, 2020

masahi commented Feb 10, 2020 •

edited

Loading

masahi commented Apr 8, 2020

[External codegen] Add test cases for fused ops with manual annotation #4741

[External codegen] Add test cases for fused ops with manual annotation #4741

Conversation

masahi commented Jan 19, 2020 • edited Loading

zhiics Jan 19, 2020

Choose a reason for hiding this comment

masahi Jan 19, 2020 • edited Loading

Choose a reason for hiding this comment

masahi Jan 19, 2020 • edited Loading

Choose a reason for hiding this comment

comaniac Jan 20, 2020

Choose a reason for hiding this comment

masahi Jan 20, 2020

Choose a reason for hiding this comment

masahi Jan 20, 2020

Choose a reason for hiding this comment

zhiics Jan 20, 2020

Choose a reason for hiding this comment

masahi Jan 20, 2020

Choose a reason for hiding this comment

masahi commented Jan 19, 2020 • edited Loading

masahi commented Jan 19, 2020 • edited Loading

comaniac left a comment

Choose a reason for hiding this comment

comaniac Jan 20, 2020

Choose a reason for hiding this comment

comaniac commented Feb 10, 2020

masahi commented Feb 10, 2020 • edited Loading

masahi commented Apr 8, 2020

masahi commented Jan 19, 2020 •

edited

Loading

masahi Jan 19, 2020 •

edited

Loading

masahi Jan 19, 2020 •

edited

Loading

masahi commented Jan 19, 2020 •

edited

Loading

masahi commented Jan 19, 2020 •

edited

Loading

masahi commented Feb 10, 2020 •

edited

Loading