gh-117139: A StackRef Debugging Mode #121134

Fidget-Spinner · 2024-06-28T16:03:52Z

This enforces HPy-like semantics in CPython's eval loop. It's checked at runtime with a new handle table. This catches errors like double decref, introducing dead references to the stack, etc.

See https://github.com/faster-cpython/ideas/blob/2beefd4da2bf956c837e8fb7097b7a43c53f5735/3.14/stackref_semantics.md
for more info.

I set up a single runner that just runs python -m test test_asyncio as this mode is 2.5x slower than normal CPython. That is usually enough to catch almost all bugs. Besides, this mode is also not fully compatible with all our tests that introspect CPython.

This PR depends on #121127 being merged first.

Issue: Set up tagged pointers in the evaluation stack #117139

markshannon · 2024-06-28T17:41:58Z

I don't think you need the live flag in the stack ref. The presence of the bits in the mapping implies liveness.
That way the _PyStackRef struct can remain unchanged.

Something like this:

THE_MAP: dict[uintptr_t, object] = {}
THE_COUNTER = 0

class StackRef:
    bits: uintptr_t

    def __init__(self, obj):
        THE_COUNTER += 1
        self.bits = THE_COUNTER
        THE_MAP[self.bits] = obj
       
    def close(self):
        del THE_MAP[self.bits]

    def dup(ref: StackRef) -> StackRef:
        obj = THE_MAP[ref.bits]
        return StackRef(obj)

    def steal_obj(ref: StackRef) -> object:
        return THE_MAP.pop(ref.bits)
      
    def steal_ref(ref: StackRef) -> StackRef:
        return StackRef.from_obj_steal(self.steal_obj())
  
    @classmethod
    def from_obj_steal(obj: object) -> StackRef:
        res = StackRef(obj)
        Py_DECREF(object)
        return res

    @classmethod
    def from_obj_new(obj: object) -> StackRef:
        return StackRef(obj)

The hashtable in hashtable.c should provide the basics of the mapping.

markshannon

How much extra effort would it be do this for the default build?

Can you try this on the full set of tests, just to see if it would be acceptable on CI?

markshannon · 2024-07-17T08:12:09Z

.github/workflows/build.yml

@@ -551,6 +564,7 @@ jobs:
    - check_generated_files
    - build_macos
    - build_macos_free_threading
+    - build_macos_free_threading_with_stackref_debug


Could we do this for normal builds as well?
The linux machines are cheapest, so we should probably use those.

We can't because normal builds require all the inline functions to be macros #121263. And these are too complicated to fit into a single macro.

markshannon · 2024-07-17T08:12:14Z

.github/workflows/reusable-macos.yml

          --prefix=/opt/python-dev \
          --with-openssl="$(brew --prefix [email protected])"
    - name: Build CPython
      run: make -j8
    - name: Display build info
      run: make pythoninfo
    - name: Tests
-      run: make test
+      # Stackref debug is 3.5x slower than normal CPython,


How about using the set of tests for profiling? It is about 40 tests instead of the usual ~400, so would give decent coverage without being too slow.

Sorry how do I run those specific tests? What command with python -m test?

Include/internal/pycore_stackref.h

Fidget-Spinner · 2024-07-17T14:04:23Z

Can you try this on the full set of tests, just to see if it would be acceptable on CI?

I did. It fails

    test.test_gdb.test_pretty_print test_capi test_exceptions
    test_regrtest test_repl test_sys test_tracemalloc

mainly because they count allocations/test out of memory situations, and the hashtable adds random allocations which breaks their careful calculations.

On my computer on free threaded builds with the stackref debugging mode, it takes 5 minutes to run the whole test suite. macOS is the fastest/"cheapest" runner we have right now because Cirrus Labs is sponsoring bare-metal runners IIRC. Without stackref debug mode, it takes 2 min 7 s. So roughly a 2.5x slowdown. Factoring that in, the fastest Cirruslabs runners takes 6 min to complete on this PR. So it should take 15 minutes to run the whole test suite. We would need to skip those tests above too, as those are known failures.

ericsnowcurrently · 2024-07-18T16:46:35Z

mainly because they count allocations/test out of memory situations

That's too bad. I wonder what it would take to fix those tests to be less fragile (tightly coupled to fine details)? I've opened gh-121978 so we can consider that separately.

Fidget-Spinner added 2 commits June 28, 2024 23:59

A StackRef Debugging Mode

ddf839e

Create 2024-06-29-00-02-42.gh-issue-117139.-lKXPB.rst

0b0fa3d

Fidget-Spinner requested review from ericsnowcurrently, markshannon, erlend-aasland, corona10, ezio-melotti and hugovk as code owners June 28, 2024 16:03

bedevere-app bot mentioned this pull request Jun 28, 2024

Set up tagged pointers in the evaluation stack #117139

Closed

bedevere-app bot added the awaiting core review label Jun 28, 2024

fix CI

d2c7bb9

Fidget-Spinner added the topic-free-threading label Jun 28, 2024

Fixup defines

2c6860a

Fidget-Spinner added 5 commits June 30, 2024 16:54

Use a hashtable instead

ba49e58

fix unused warning

18d26ce

fix build

5ff2a57

Fix build again

e2563c6

Merge remote-tracking branch 'upstream/main' into stackref_debug_real

fe687c3

markshannon reviewed Jul 17, 2024

View reviewed changes

Fidget-Spinner added 2 commits July 17, 2024 18:03

Merge remote-tracking branch 'upstream/main' into stackref_debug_real

a6e6576

cleanup

a058c34

ericsnowcurrently mentioned this pull request Jul 18, 2024

Make Memory-Related Tests Less Fragile #121978

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-117139: A StackRef Debugging Mode #121134

gh-117139: A StackRef Debugging Mode #121134

Fidget-Spinner commented Jun 28, 2024 •

edited

Loading

markshannon commented Jun 28, 2024 •

edited

Loading

markshannon left a comment

markshannon Jul 17, 2024

Fidget-Spinner Jul 17, 2024

markshannon Jul 17, 2024

Fidget-Spinner Jul 17, 2024

Fidget-Spinner commented Jul 17, 2024

ericsnowcurrently commented Jul 18, 2024

gh-117139: A StackRef Debugging Mode #121134

Are you sure you want to change the base?

gh-117139: A StackRef Debugging Mode #121134

Conversation

Fidget-Spinner commented Jun 28, 2024 • edited Loading

markshannon commented Jun 28, 2024 • edited Loading

markshannon left a comment

Choose a reason for hiding this comment

markshannon Jul 17, 2024

Choose a reason for hiding this comment

Fidget-Spinner Jul 17, 2024

Choose a reason for hiding this comment

markshannon Jul 17, 2024

Choose a reason for hiding this comment

Fidget-Spinner Jul 17, 2024

Choose a reason for hiding this comment

Fidget-Spinner commented Jul 17, 2024

ericsnowcurrently commented Jul 18, 2024

Fidget-Spinner commented Jun 28, 2024 •

edited

Loading

markshannon commented Jun 28, 2024 •

edited

Loading