gh-85160: improve performance of singledispatchmethod #107148

eendebakpt · 2023-07-23T19:37:50Z

This PR implements the idea from #85160 to improve performance of the singledispatchmethod by caching the generated dispatch method. It is a continuation of #23213

Also see #106448

Issue: singledispatchmethod significantly slower than singledispatch #85160

Lib/functools.py

Lib/test/test_functools.py

Co-authored-by: Alex Waygood <[email protected]>

Lib/functools.py

Co-authored-by: Alex Waygood <[email protected]>

Lib/functools.py

corona10

lgtm with nit comment

It's become a lot faster

script

import pyperf

runner = pyperf.Runner()
runner.timeit(name="bench singledispatchmethod",
              stmt="""
_ = t.go(1, 1)
""",
setup="""
from functools import singledispatch, singledispatchmethod

class Test:
    @singledispatchmethod
    def go(self, item, arg):
        pass

    @go.register
    def _(self, item: int, arg):
        return item + arg

t = Test()
""")

Mean +- std dev: [base] 1.37 us +- 0.01 us -> [opt] 410 ns +- 2 ns: 3.35x faster

eendebakpt · 2023-07-30T20:51:35Z

@AlexWaygood The weakref approach can be improved a bit. We can use _all_weakrefable_instances instead of _unweakrefable_instances, which avoids a not. Also we can use _all_weakrefable_instances to avoid the expensive lookups in the WeakrefDictionary in the case of slotted classes.

That's very nice indeed. Good spot! I can reproduce your results on my machine.

Could you change your PR to this approach? I think I agree with @ethanhs that mutating the instance dictionary of instances with singeldispatchmethod is too risky a behaviour change here :/

The PR was adapted. Since cls is not None is True for all common cases (method, staticmethod, classmethod) I swapped the _all_weakrefable_instances and cls is not None

Lib/functools.py

Misc/NEWS.d/next/Library/2020-11-10-07-04-15.bpo-40988.5kBC-O.rst

Co-authored-by: Alex Waygood <[email protected]>

AlexWaygood · 2023-07-31T12:46:40Z

I did some more playing around locally and found we could improve performance even more with a few tweaks. Proposed a PR to your branch at eendebakpt#5, which also adds some more test coverage.

…-perf Ensure we only weakref() `obj` once

eendebakpt · 2023-07-31T16:11:18Z

@AlexWaygood I merged your PR and added one more optimization: we can skip the obj is not None check since for obj=None the weakref indexing will fail with a TypeError.

One more optimization is in https://github.com/eendebakpt/cpython/pull/6/files. It eliminates the caching variable. The assumption there is that if self._method_cache[obj] returns a KeyError, the expression self._method_cache[obj] = ... is safe.

I still have to do benchmarking on this.

Eliminate caching variable

AlexWaygood

LGTM. @ethanhs, how does this look to you now?

eendebakpt · 2023-08-01T15:41:24Z

Latest benchmarks:

bench singledispatchmethod: Mean +- std dev: [main] 2.48 us +- 0.02 us -> [pr] 1.04 us +- 0.02 us: 2.38x faster
bench classmethod: Mean +- std dev: [main] 3.45 us +- 0.06 us -> [pr] 3.50 us +- 0.07 us: 1.01x slower
bench classmethod on instance: Mean +- std dev: [main] 3.49 us +- 0.06 us -> [pr] 1.12 us +- 0.01 us: 3.11x faster
bench slotted class: Mean +- std dev: [main] 2.48 us +- 0.03 us -> [pr] 2.53 us +- 0.03 us: 1.02x slower

Geometric mean: 1.64x faster

So the performance improves for the common case (normal methods) and is more or less equal for class methods and slotted classes.

One more variation is to eliminate attribute lookups in the dispatch wrapper: eendebakpt/cpython@singledispatchmethod3...eendebakpt:cpython:singledispatchmethod3b
This results in (compared to this PR):

bench singledispatchmethod: Mean +- std dev: [pr] 1.04 us +- 0.02 us -> [pr_lookup] 979 ns +- 15 ns: 1.06x faster
bench classmethod on instance: Mean +- std dev: [pr] 1.12 us +- 0.01 us -> [pr_lookup] 1.06 us +- 0.01 us: 1.06x faster
bench slotted class: Mean +- std dev: [pr] 2.53 us +- 0.03 us -> [pr_lookup] 2.52 us +- 0.03 us: 1.01x faster

Benchmark hidden because not significant (1): bench classmethod

Geometric mean: 1.03x faster

AlexWaygood · 2023-08-01T16:35:00Z

One more variation is to eliminate attribute lookups in the dispatch wrapper: eendebakpt/[email protected]:cpython:singledispatchmethod3b

I can reproduce the speedup locally (great spot!), and it seems like a reasonable thing to do. Please name the variable dispatch rather than dp, though :)

One other optimisation that shaves around 5 microseconds off the benchmark for slotted classes and other objects that can't be weakref'd (but doesn't do much for those that can):

--- a/Lib/functools.py
+++ b/Lib/functools.py
@@ -959,7 +959,7 @@ def _method(*args, **kwargs):
             method = self.dispatcher.dispatch(args[0].__class__)
             return method.__get__(obj, cls)(*args, **kwargs)

-        _method.__isabstractmethod__ = self.__isabstractmethod__
+        _method.__isabstractmethod__ = getattr(self.func, "__isabstractmethod__", False)

But if you want to do ^that, you should probably add a comment about why we're duplicating the logic in the __isabstractmethod__ property.

AlexWaygood · 2023-08-01T20:58:21Z

One other optimisation that shaves around 5 microseconds off the benchmark for slotted classes and other objects that can't be weakref'd (but doesn't do much for those that can):

--- a/Lib/functools.py
+++ b/Lib/functools.py
@@ -959,7 +959,7 @@ def _method(*args, **kwargs):
             method = self.dispatcher.dispatch(args[0].__class__)
             return method.__get__(obj, cls)(*args, **kwargs)

-        _method.__isabstractmethod__ = self.__isabstractmethod__
+        _method.__isabstractmethod__ = getattr(self.func, "__isabstractmethod__", False)

Hmm, I thought I could see a speedup from this earlier, but now I can no longer reproduce it. I assume it was only noise -- best to leave it.

AlexWaygood · 2023-08-06T12:37:34Z

@mental32 and @eendebakpt, thanks so much for all your work on this!

AlexWaygood · 2023-08-06T19:40:06Z

Lib/functools.py

+            try:
+                _method = self._method_cache[obj]
+            except TypeError:
+                self._all_weakrefable_instances = False


Hmm, I wonder if this would have been a good idea, to keep memory usage down for the slow path (since in the slow path, we don't really need the WeakKeyDictionary at all)

Suggested change

self._all_weakrefable_instances = False

self._all_weakrefable_instances = False

del self._method_cache

@corona10, what do you think? Does it matter?

One could even delay the creation of the Weakkeydictionary untill the first invocation of __get__. The code will look a bit odd though, a simple del looks cleaner.

Yeah, I think I prefer the approach with del

A small followup to #107148 Co-authored-by: Serhiy Storchaka <[email protected]>

mental32 and others added 8 commits November 10, 2020 07:05

bpo-40988: Optimized singledispatchmethod access.

04fe64e

Merge branch 'main' into bpo-40988

fc72bea

Merge branch 'main' into singledispatchmethod3

31d292b

add test for slotted classes

5ae8348

add test for slotted classes

1edc0e3

update new entry with attribution to original author

9a1ffae

add test for assignment to dispatched methods

3588943

typo

af233d4

eendebakpt requested a review from rhettinger as a code owner July 23, 2023 19:37

bedevere-bot mentioned this pull request Jul 23, 2023

singledispatchmethod significantly slower than singledispatch #85160

Closed

bedevere-bot added the awaiting review label Jul 23, 2023

eendebakpt mentioned this pull request Jul 23, 2023

gh-85160: improve performance of singledispatchmethod #106448

Closed

AlexWaygood requested review from corona10 and AlexWaygood and removed request for rhettinger July 23, 2023 19:42

AlexWaygood reviewed Jul 23, 2023

View reviewed changes

Lib/functools.py Outdated Show resolved Hide resolved

AlexWaygood reviewed Jul 23, 2023

View reviewed changes

Lib/test/test_functools.py Outdated Show resolved Hide resolved

eendebakpt and others added 3 commits July 23, 2023 21:56

Update Lib/test/test_functools.py

d5aebdf

Co-authored-by: Alex Waygood <[email protected]>

apply review suggestion

b773f82

Merge branch 'main' into singledispatchmethod3

25e1915

AlexWaygood reviewed Jul 23, 2023

View reviewed changes

Lib/functools.py Outdated Show resolved Hide resolved

Lib/functools.py Outdated Show resolved Hide resolved

eendebakpt and others added 2 commits July 23, 2023 22:42

Update Lib/functools.py

593c923

Co-authored-by: Alex Waygood <[email protected]>

Update Lib/functools.py

39bec6d

Co-authored-by: Alex Waygood <[email protected]>

AlexWaygood reviewed Jul 23, 2023

View reviewed changes

Lib/functools.py Outdated Show resolved Hide resolved

Lib/functools.py Outdated Show resolved Hide resolved

use hasattr to check for slotted types

d954244

corona10 reviewed Jul 24, 2023

View reviewed changes

Lib/functools.py Outdated Show resolved Hide resolved

make attrname private

df2eeb5

corona10 reviewed Jul 24, 2023

View reviewed changes

Lib/functools.py Outdated Show resolved Hide resolved

corona10 approved these changes Jul 24, 2023

View reviewed changes

bedevere-bot removed the awaiting review label Jul 24, 2023

Merge branch 'main' into singledispatchmethod3

a422989

make patchcheck

450ea12

AlexWaygood reviewed Jul 31, 2023

View reviewed changes

Lib/functools.py Outdated Show resolved Hide resolved

Lib/functools.py Outdated Show resolved Hide resolved

review comments

032ab24

AlexWaygood reviewed Jul 31, 2023

View reviewed changes

Lib/functools.py Outdated Show resolved Hide resolved

Misc/NEWS.d/next/Library/2020-11-10-07-04-15.bpo-40988.5kBC-O.rst Outdated Show resolved Hide resolved

eendebakpt and others added 3 commits July 31, 2023 13:59

Apply suggestions from code review

18d7e43

Co-authored-by: Alex Waygood <[email protected]>

Merge branch 'main' into singledispatchmethod3

755efdd

Ensure we only weakref() obj once

56d342c

AlexWaygood and others added 5 commits July 31, 2023 15:06

Get rid of the cache local variable

a119e21

Don't assign caching in the fast path

47c18a5

Merge pull request #5 from AlexWaygood/even-more-singledispatchmethod…

2031680

…-perf Ensure we only weakref() `obj` once

remove check on obj being None

ca691e8

eliminate caching variable

e8eeedd

eendebakpt and others added 2 commits August 1, 2023 10:10

Merge pull request #6 from eendebakpt/singledispatchmethod3b

307f0c1

Eliminate caching variable

Test slotted staticmethods on instances as well as the class

afea7b9

AlexWaygood approved these changes Aug 1, 2023

View reviewed changes

eliminate method variable and reduce attribute lookups

b43dcee

AlexWaygood merged commit 3e334ae into python:main Aug 6, 2023
17 checks passed

bedevere-bot removed the awaiting merge label Aug 6, 2023

AlexWaygood mentioned this pull request Aug 6, 2023

gh-85160: Optimized singledispatchmethod access (noticeable improvement). #23213

Closed

AlexWaygood reviewed Aug 6, 2023

View reviewed changes

AlexWaygood mentioned this pull request Aug 7, 2023

gh-85160: Reduce memory usage of singledispatchmethod when owner instances cannot be weakref'd #107706

Merged

AlexWaygood added a commit that referenced this pull request Aug 7, 2023

gh-85160: Reduce memory usage of singledispatchmethod (#107706)

2ac103c

A small followup to #107148 Co-authored-by: Serhiy Storchaka <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-85160: improve performance of singledispatchmethod #107148

gh-85160: improve performance of singledispatchmethod #107148

eendebakpt commented Jul 23, 2023 •

edited by bedevere-bot

Loading

corona10 left a comment

eendebakpt commented Jul 30, 2023

AlexWaygood commented Jul 31, 2023 •

edited

Loading

eendebakpt commented Jul 31, 2023 •

edited

Loading

AlexWaygood left a comment

eendebakpt commented Aug 1, 2023

AlexWaygood commented Aug 1, 2023 •

edited

Loading

AlexWaygood commented Aug 1, 2023

AlexWaygood commented Aug 6, 2023

AlexWaygood Aug 6, 2023

eendebakpt Aug 6, 2023

AlexWaygood Aug 6, 2023

	self._all_weakrefable_instances = False
	self._all_weakrefable_instances = False
	del self._method_cache

gh-85160: improve performance of singledispatchmethod #107148

gh-85160: improve performance of singledispatchmethod #107148

Conversation

eendebakpt commented Jul 23, 2023 • edited by bedevere-bot Loading

corona10 left a comment

Choose a reason for hiding this comment

script

eendebakpt commented Jul 30, 2023

AlexWaygood commented Jul 31, 2023 • edited Loading

eendebakpt commented Jul 31, 2023 • edited Loading

AlexWaygood left a comment

Choose a reason for hiding this comment

eendebakpt commented Aug 1, 2023

AlexWaygood commented Aug 1, 2023 • edited Loading

AlexWaygood commented Aug 1, 2023

AlexWaygood commented Aug 6, 2023

AlexWaygood Aug 6, 2023

Choose a reason for hiding this comment

eendebakpt Aug 6, 2023

Choose a reason for hiding this comment

AlexWaygood Aug 6, 2023

Choose a reason for hiding this comment

eendebakpt commented Jul 23, 2023 •

edited by bedevere-bot

Loading

AlexWaygood commented Jul 31, 2023 •

edited

Loading

eendebakpt commented Jul 31, 2023 •

edited

Loading

AlexWaygood commented Aug 1, 2023 •

edited

Loading