gh-115999: Refactor `LOAD_GLOBAL` specializations to avoid reloading {globals, builtins} keys #124953

mpage · 2024-10-03T21:51:58Z

Each of the LOAD_GLOBAL specializations is implemented roughly as:

Load keys version.
Load cached keys version.
Deopt if (1) and (2) don't match.
Load keys.
Load cached index into keys.
Load object from (4) at offset from (5).

This is not thread-safe in free-threaded builds; the keys object may be replaced in between steps (3) and (4).

This change refactors the specializations to avoid reloading the keys object and instead pass the keys object from guards to be consumed by downstream uops.

Issue: Make the specializing interpreter thread-safe in --disable-gil builds #115999

…uiltins} keys Each of the `LOAD_GLOBAL` specializations is implemented roughly as: 1. Load keys version. 2. Load cached keys version. 3. Deopt if (1) and (2) don't match. 4. Load keys. 5. Load cached index into keys. 6. Load object from (4) at offset from (5). This is not thread-safe in free-threaded builds; the keys object may be replaced in between steps (3) and (4). This change refactors the specializations to avoid reloading the keys object and instead pass the keys object from guards to be consumed by downstream uops.

rruuaanng · 2024-10-05T04:16:08Z

Can you split the change? This makes it difficult for us to review.

Python/bytecodes.c

Python/executor_cases.c.h

Ensure we update the stack to reflect that we've popped the keys. There should be nothing on the stack if we deopt.

Python/optimizer_cases.c.h

Maybe this will stop tickling the msvc compiler bug, too?

colesbury

The x86_64-pc-windows-msvc/msvc (Release) JIT failure is not new.

markshannon

Here's the review I did yesterday, but didn't quite finish in time 🙂

markshannon · 2024-10-09T14:59:56Z

Python/optimizer_analysis.c

                if (incorrect_keys(inst, builtins)) {
                    OPT_STAT_INC(remove_globals_incorrect_keys);
                    return 0;
                }
                if (interp->rare_events.builtin_dict >= _Py_MAX_ALLOWED_BUILTINS_MODIFICATIONS) {
                    continue;
                }
+                if (!check_next_uop(buffer, buffer_size, pc,


We want the optimizer passes to be (as much as possible) simple, fast scans over the uop sequence.
So, I'd like to avoid this sort of non-local check if possible.

Generally we want each pass to be a linear scan which maintains a small set of knowledge, like function_checked, etc. above.
Each case should then either update that knowledge or perform a simple optimization based on that knowledge.

FYI, I plan to merge this pass into optimizer_bytecodes.c which is also a linear pass with similar design principles (at least it should be).

markshannon · 2024-10-09T15:34:22Z

Python/bytecodes.c

@@ -4871,6 +4884,26 @@ dummy_func(
            DEOPT_IF(func->func_version != func_version);
        }

+        tier2 op(_LOAD_GLOBAL_MODULE, (index/1 -- res, null if (oparg & 1))) {


This is effectively pushes the global's keys then does _LOAD_GLOBAL_MODULE_FROM_KEYS.

Maybe add a tier2 op that only pushes the keys? It might make the optimizer simpler as well.

markshannon · 2024-10-09T15:34:31Z

Python/bytecodes.c

+            null = PyStackRef_NULL;
+         }
+
+        tier2 op(_LOAD_GLOBAL_BUILTINS, (index/1 -- res, null if (oparg & 1))) {


markshannon · 2024-10-10T11:26:43Z

No need to revert anything, but I would like to remove _LOAD_GLOBAL_MODULE/_LOAD_GLOBAL_BUILTINS and replace them with _LOAD_GLOBAL_KEYS/_LOAD_BUILTINS_KEYS.

bedevere-app bot mentioned this pull request Oct 3, 2024

Make the specializing interpreter thread-safe in --disable-gil builds #115999

Open

mpage added the skip news label Oct 3, 2024

mpage requested a review from brandtbucher October 3, 2024 22:14

mpage marked this pull request as ready for review October 3, 2024 22:52

mpage requested review from Fidget-Spinner and markshannon as code owners October 3, 2024 22:52

bedevere-app bot added the awaiting review label Oct 3, 2024

mpage requested a review from colesbury October 7, 2024 18:05

colesbury reviewed Oct 7, 2024

View reviewed changes

Python/bytecodes.c Show resolved Hide resolved

Python/executor_cases.c.h Show resolved Hide resolved

mpage added 3 commits October 7, 2024 16:32

Sync stack pointer in ops that consume keys

d00adef

Ensure we update the stack to reflect that we've popped the keys. There should be nothing on the stack if we deopt.

Merge branch 'main' into pythongh-115999-refactor-load-global

e68f7e0

Mark keys dead

7a682cc

mpage requested a review from colesbury October 7, 2024 23:57

colesbury reviewed Oct 8, 2024

View reviewed changes

Python/optimizer_cases.c.h Outdated Show resolved Hide resolved

Fix compiler warning in optimizer_cases.c.h

93f121b

Maybe this will stop tickling the msvc compiler bug, too?

colesbury approved these changes Oct 8, 2024

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Oct 8, 2024

Merge branch 'main' into pythongh-115999-refactor-load-global

d41b85a

colesbury enabled auto-merge (squash) October 9, 2024 14:50

colesbury merged commit f978fb4 into python:main Oct 9, 2024
54 of 55 checks passed

bedevere-app bot removed the awaiting merge label Oct 9, 2024

markshannon reviewed Oct 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-115999: Refactor `LOAD_GLOBAL` specializations to avoid reloading {globals, builtins} keys #124953

gh-115999: Refactor `LOAD_GLOBAL` specializations to avoid reloading {globals, builtins} keys #124953

mpage commented Oct 3, 2024 •

edited by bedevere-app bot

Loading

rruuaanng commented Oct 5, 2024

colesbury left a comment

markshannon left a comment

markshannon Oct 9, 2024

markshannon Oct 9, 2024

markshannon Oct 9, 2024

markshannon commented Oct 10, 2024

gh-115999: Refactor LOAD_GLOBAL specializations to avoid reloading {globals, builtins} keys #124953

gh-115999: Refactor LOAD_GLOBAL specializations to avoid reloading {globals, builtins} keys #124953

Conversation

mpage commented Oct 3, 2024 • edited by bedevere-app bot Loading

rruuaanng commented Oct 5, 2024

colesbury left a comment

Choose a reason for hiding this comment

markshannon left a comment

Choose a reason for hiding this comment

markshannon Oct 9, 2024

Choose a reason for hiding this comment

markshannon Oct 9, 2024

Choose a reason for hiding this comment

markshannon Oct 9, 2024

Choose a reason for hiding this comment

markshannon commented Oct 10, 2024

gh-115999: Refactor `LOAD_GLOBAL` specializations to avoid reloading {globals, builtins} keys #124953

gh-115999: Refactor `LOAD_GLOBAL` specializations to avoid reloading {globals, builtins} keys #124953

mpage commented Oct 3, 2024 •

edited by bedevere-app bot

Loading