[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpool2d) #105639

leslie-fang-intel · 2023-07-20T04:26:08Z

Stack from ghstack (oldest at bottom):

Summary
In this PR, we mainly enable 2 things.

Enable the skeleton of quantization recipe for single quantizable operators in X86InductorQuantizer.
Add quantization recipe of maxpool2d and annotate it as input./output share observer.

Test Plan

python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe

pytorch-bot · 2023-07-20T04:26:10Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105639

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 2cd113c with merge base 97a291f ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

linux-focal-rocm5.6-py3.8 / test (default, 1, 3, linux.rocm.gpu, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…ol2d) [ghstack-poisoned]

…ol2d) ghstack-source-id: a1331e5412b957705ec32a869490e60ec894ef2f Pull Request resolved: #105639

…le op(maxpool2d)" **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_non_quantizable_op_after_force_int8_int8_op ``` [ghstack-poisoned]

torch/ao/quantization/pt2e/quantizer/x86_inductor_quantizer.py

…ol2d) ghstack-source-id: a1331e5412b957705ec32a869490e60ec894ef2f Pull Request resolved: pytorch#105639

…ol2d) ghstack-source-id: 0ace32d1cf1434f9c55e3bee6f18922242a14a4d Pull Request resolved: #105639

…le op(maxpool2d)" **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` [ghstack-poisoned]

…ol2d) ghstack-source-id: d482e5dcaf6bb2537b7733f4debffc1136e532f2 Pull Request resolved: #105639

…le op(maxpool2d)" **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` [ghstack-poisoned]

jerryzh168 · 2023-08-21T18:51:59Z

torch/ao/quantization/quantizer/x86_inductor_quantizer.py

@@ -28,14 +29,83 @@
    get_source_partitions,
    SourcePartition,
 )
-from .quantizer import QuantizationAnnotation, QuantizationSpec, Quantizer
+from .quantizer import (


nit: probably better to import from "torch.ao.quantization.quantizer"

Thanks for comment and changed.

…le op(maxpool2d)" **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` [ghstack-poisoned]

…ol2d) ghstack-source-id: 0a0d7a11ebfb995a2d840d82667a193e92da62ee Pull Request resolved: pytorch#105639

…le op(maxpool2d)" **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` [ghstack-poisoned]

jerryzh168 · 2023-08-23T05:53:50Z

torch/ao/quantization/quantizer/x86_inductor_quantizer.py

+                    list(maxpool_node.users)[0].target == operator.getitem
+                )
+                getitem_node = list(node.users)[0]
+                if not _is_all_annotated([getitem_node, maxpool_node]):


you probably want to skip if any of the node is annotated? otherwise you will be breaking some existing annotations?

Oh, I think here we add this check because we expected getitem_node and maxpool_node already been annotated in previous step of _annotation_propagation_quantizable_pattern which only annotates the inputs of these 2 nodes. Then, here we will annotate the output of getitem_node here.

I see, I feel this is error prone, since these might be annotated because of other reasons, I think the more robust way to annotate things would be to have a self contained meaningful pattern and just annotate that pattern by itself.

Make sense. We may discuss about it further.

jerryzh168

accepting to unblock

…ol2d) ghstack-source-id: 52b2a7858869e23c64838d1e70e4e82aa3c9c824 Pull Request resolved: pytorch#105639

…le op(maxpool2d)" **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` [ghstack-poisoned]

…ol2d) ghstack-source-id: 427a8dff03e45e0d76073ccca839d7ef63c6a1c4 Pull Request resolved: pytorch#105639

…le op(maxpool2d)" **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` [ghstack-poisoned]

leslie-fang-intel · 2023-08-26T08:32:14Z

@pytorchbot merge

pytorchmergebot · 2023-08-26T08:34:04Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

**Summary** Enable the `dq-maxpool2d-q` pattern match and lower into `torch.ops.quantized.max_pool2d`. **Test Plan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qmaxpool2d python -m pytest test_quantized_op.py -k test_max_pool2d_pt2e ``` Pull Request resolved: #105906 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639

**Summary** After oneDNN 3.1 upgrade, we don't need to do the weight scale reciprocal calculation. So, remove the redundant reciprocal calculation to optimize QConv performance and using IDeep version API to implement it in this PR: - This QConv implementation expects to work functionally both with current IDeep version and the following IDeep upgrade in PR: #107565. - With the following IDeep upgrade in PR: #107565, the QConv has better performance since the redundant reciprocal calculation are removed. Pull Request resolved: #105996 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639, #105906

…ht scale reciprocal calculation (#107565) **Summary** Upgrade IDeep which includes 1 IDeep change as IDeep PR: intel/ideep#226 - For IDeep PR: intel/ideep#226 which has done 2 things: - Remove the redundant QConv weight scale reciprocal calculation. - Pump IDEEP_VERSION_REVISION version from 0 to 1. So only QConv related calculation will be impacted and we already use IDeep version API in #105996 to make the corresponding change in PyTorch. Pull Request resolved: #107565 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639, #105906, #105996

…ol2d) (#105639) **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` Pull Request resolved: #105639 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456

**Summary** Enable the `dq-maxpool2d-q` pattern match and lower into `torch.ops.quantized.max_pool2d`. **Test Plan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qmaxpool2d python -m pytest test_quantized_op.py -k test_max_pool2d_pt2e ``` Pull Request resolved: #105906 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639

**Summary** After oneDNN 3.1 upgrade, we don't need to do the weight scale reciprocal calculation. So, remove the redundant reciprocal calculation to optimize QConv performance and using IDeep version API to implement it in this PR: - This QConv implementation expects to work functionally both with current IDeep version and the following IDeep upgrade in PR: #107565. - With the following IDeep upgrade in PR: #107565, the QConv has better performance since the redundant reciprocal calculation are removed. Pull Request resolved: #105996 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639, #105906

…ht scale reciprocal calculation (#107565) **Summary** Upgrade IDeep which includes 1 IDeep change as IDeep PR: intel/ideep#226 - For IDeep PR: intel/ideep#226 which has done 2 things: - Remove the redundant QConv weight scale reciprocal calculation. - Pump IDEEP_VERSION_REVISION version from 0 to 1. So only QConv related calculation will be impacted and we already use IDeep version API in #105996 to make the corresponding change in PyTorch. Pull Request resolved: #107565 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639, #105906, #105996

leslie-fang-intel requested a review from jerryzh168 as a code owner July 20, 2023 04:26

pytorch-bot bot added the release notes: quantization release notes category label Jul 20, 2023

[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpo…

feb1de2

…ol2d) [ghstack-poisoned]

leslie-fang-intel requested a review from jgong5 July 20, 2023 04:29

leslie-fang-intel marked this pull request as draft July 20, 2023 04:30

pytorchbot added the open source label Jul 20, 2023

leslie-fang-intel added a commit that referenced this pull request Jul 20, 2023

[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpo…

886beab

…ol2d) ghstack-source-id: a1331e5412b957705ec32a869490e60ec894ef2f Pull Request resolved: #105639

leslie-fang-intel added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 20, 2023

jgong5 requested changes Jul 20, 2023

View reviewed changes

leslie-fang-intel added a commit that referenced this pull request Jul 24, 2023

[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpo…

23aabe3

…ol2d) ghstack-source-id: 0ace32d1cf1434f9c55e3bee6f18922242a14a4d Pull Request resolved: #105639

leslie-fang-intel added a commit that referenced this pull request Jul 24, 2023

[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpo…

977c5b2

…ol2d) ghstack-source-id: d482e5dcaf6bb2537b7733f4debffc1136e532f2 Pull Request resolved: #105639

leslie-fang-intel requested a review from jgong5 July 24, 2023 09:57

jerryzh168 reviewed Aug 21, 2023

View reviewed changes

leslie-fang-intel requested a review from jerryzh168 August 21, 2023 23:24

leslie-fang-intel mentioned this pull request Aug 22, 2023

Test IDeep Upgrade for ARM in IDeep #107676

Closed

jerryzh168 reviewed Aug 23, 2023

View reviewed changes

leslie-fang-intel requested a review from jerryzh168 August 23, 2023 06:23

jerryzh168 approved these changes Aug 23, 2023

View reviewed changes

leslie-fang-intel mentioned this pull request Aug 25, 2023

[Quant][PT2E]Make _fuse_conv_bn_ support graph capture by torch._dynamo.export #107951

Closed

pytorchmergebot added the merging label Aug 26, 2023

pytorchmergebot added Merged and removed merging labels Aug 26, 2023

pytorchmergebot closed this in 70ca18f Aug 26, 2023

facebook-github-bot deleted the gh/leslie-fang-intel/61/head branch August 29, 2023 14:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpool2d) #105639

[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpool2d) #105639

leslie-fang-intel commented Jul 20, 2023 •

edited

Loading

pytorch-bot bot commented Jul 20, 2023 •

edited

Loading

jerryzh168 Aug 21, 2023

leslie-fang-intel Aug 21, 2023

jerryzh168 Aug 23, 2023

leslie-fang-intel Aug 23, 2023 •

edited

Loading

jerryzh168 Aug 23, 2023

leslie-fang-intel Aug 24, 2023

jerryzh168 left a comment

leslie-fang-intel commented Aug 26, 2023

pytorchmergebot commented Aug 26, 2023

[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpool2d) #105639

[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpool2d) #105639

Conversation

leslie-fang-intel commented Jul 20, 2023 • edited Loading

pytorch-bot bot commented Jul 20, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105639

✅ You can merge normally! (1 Unrelated Failure)

jerryzh168 Aug 21, 2023

Choose a reason for hiding this comment

leslie-fang-intel Aug 21, 2023

Choose a reason for hiding this comment

jerryzh168 Aug 23, 2023

Choose a reason for hiding this comment

leslie-fang-intel Aug 23, 2023 • edited Loading

Choose a reason for hiding this comment

jerryzh168 Aug 23, 2023

Choose a reason for hiding this comment

leslie-fang-intel Aug 24, 2023

Choose a reason for hiding this comment

jerryzh168 left a comment

Choose a reason for hiding this comment

leslie-fang-intel commented Aug 26, 2023

pytorchmergebot commented Aug 26, 2023

Merge started

leslie-fang-intel commented Jul 20, 2023 •

edited

Loading

pytorch-bot bot commented Jul 20, 2023 •

edited

Loading

leslie-fang-intel Aug 23, 2023 •

edited

Loading