Store only the IDs needed for Query iteration #12476

james7132 · 2024-03-14T09:12:52Z

Objective

Other than the exposed functions for reading matched tables and archetypes, a QueryState does not actually need both internal Vecs for storing matched archetypes and tables. In practice, it will only use one of the two depending on if it uses dense or archetypal iteration.

Same vein as #12474. The goal is to reduce the memory overhead of using queries, which Bevy itself, ecosystem plugins, and end users are already fairly liberally using.

Solution

Add StorageId, which is a union over TableId and ArchetypeId, and store only one of the two at runtime. Read the slice as if it was one ID depending on whether the query is dense or not.

This follows in the same vein as #5085; however, this one directly impacts heap memory usage at runtime, while #5085 primarily targeted transient pointers that might not actually exist at runtime.

Changelog

Changed: QueryState::matched_tables now returns an iterator instead of a reference to a slice.
Changed: QueryState::matched_archetypes now returns an iterator instead of a reference to a slice.

Migration Guide

QueryState::matched_tables and QueryState::matched_archetypes does not return a reference to a slice, but an iterator instead. You may need to use iterator combinators or collect them into a Vec to use it as a slice.

crates/bevy_ecs/src/query/state.rs

alice-i-cecile

Okay, I can see the motivation here. I don't think this is a meaningful regression in terms of readability, and saving memory here is a useful application.

To confirm my understanding:

Whether or not a query uses dense or sparse iteration can be determined at constant time, based on whether or not both the data and filter are dense (aka don't contain any sparse-set components).
We can reuse the same space to store the table (dense) and archetype (sparse) IDs, since we always care about exactly one of the two.
We don't want to use an enum here, because the discriminant would take an extra byte, which we don't need since we can always just reconstruct this based on the constants already stored in our QueryState.

james7132 · 2024-03-17T19:32:53Z

We don't want to use an enum here, because the discriminant would take an extra byte, which we don't need since we can always just reconstruct this based on the constants already stored in our QueryState.

It's actually an extra 4 bytes due to padding, which isn't a huge deal, but the branch on something that should be an invariant is potentially worse in hot loops. It could be an enum and then use DebugCheckedUnwrap, but that might negatively impact readability, may add some optimization blockers to the compiler if widespread enough, and it would always negate any memory savings from this PR.

Co-authored-by: Alice Cecile <[email protected]>

cBournhonesque

I read through this code recently to try to finally understand ECS internals, so i challenged myself to review this

cBournhonesque · 2024-03-21T17:46:22Z

crates/bevy_ecs/src/query/state.rs

            }
            let table_index = archetype.table_id().as_usize();
            if !self.matched_tables.contains(table_index) {
                self.matched_tables.grow_and_insert(table_index);
-                self.matched_table_ids.push(archetype.table_id());
+                if D::IS_DENSE && F::IS_DENSE {


Do we still also need separate self.matched_tables and self.matched_archetypes bitsets?
It looks like only one of the two will be used, depending on the value of D::IS_DENSE and F::IS_DENSE

Maybe it's still needed for things like join?

Yep exactly, join needs them, and get_unchecked_manual needs specifically matched_archetypes (right now).

cBournhonesque · 2024-03-21T17:50:49Z

crates/bevy_ecs/src/query/state.rs

@@ -21,6 +21,17 @@ use super::{
    QuerySingleError, ROQueryItem,
 };

+/// An ID for either a table or an archetype. Used for Query iteration.


I don't know if it would be useful to specify that this is used for optimizing query iteration; in the case where all components are in tables, we can iterate through table_ids directly instead of archetypes

I would also maybe add a comment on why this being a union is ok here, as unions are rarer than enums. It's because we know which variant to use depending on QueryState::IS_DENSE so we can skip storing the variant type?

cBournhonesque · 2024-03-21T17:51:28Z

crates/bevy_ecs/src/query/iter.rs

@@ -650,8 +648,7 @@ impl<'w, 's, D: ReadOnlyQueryData, F: QueryFilter, const K: usize> FusedIterator
 }

 struct QueryIterationCursor<'w, 's, D: QueryData, F: QueryFilter> {
-    table_id_iter: std::slice::Iter<'s, TableId>,
-    archetype_id_iter: std::slice::Iter<'s, ArchetypeId>,
+    storage_id_iter: std::slice::Iter<'s, StorageId>,


Does that mean that the table_entities and archetype_entities below could have a similar optiimization as storage_entities: &'w [StorageEntities]?

This is what #5085 does, however the benefit may be limited as this struct doesn't really exist at runtime: it's never formally materialized to the stack or the heap under normal use cases and any inlined iteration will decompose the fetches and updates.

Compare this with the Vec in QueryState, which requires both stack and heap space due to being a persisted heap allocated backing for Query. There's real memory savings by using the union.

Doable but I'm not sure if the savings are worth the introduction of more unsafe and readability impact.

crates/bevy_ecs/src/query/state.rs

james7132 added 2 commits March 13, 2024 23:50

Remove unnecessary matched ID storage

b35c3d1

Adjust the if checks when updating archetypes

edcc766

james7132 added A-ECS Entities, components, systems, and events C-Performance A change motivated by improving speed, memory usage or compile times M-Needs-Migration-Guide A breaking change to Bevy's public API that needs to be noted in a migration guide labels Mar 14, 2024

Formatting

057adf3

MrGVSV reviewed Mar 14, 2024

View reviewed changes

crates/bevy_ecs/src/query/state.rs Outdated Show resolved Hide resolved

Cleanup code and docs

7881113

james7132 mentioned this pull request Mar 15, 2024

Use dense fetches in Query::get and Query::iter_many #12501

Open

james7132 added 3 commits March 15, 2024 16:29

Formatting

89c77fd

Fix docs

8e8b82b

Merge branch 'main' into union-query-ids

190af72

james7132 marked this pull request as ready for review March 15, 2024 23:36

Fix soundness issue when joining queries

f24f77f

alice-i-cecile reviewed Mar 17, 2024

View reviewed changes

crates/bevy_ecs/src/query/state.rs Outdated Show resolved Hide resolved

alice-i-cecile approved these changes Mar 17, 2024

View reviewed changes

james7132 and others added 2 commits March 17, 2024 12:34

pub(crate) defensively

69f267b

Co-authored-by: Alice Cecile <[email protected]>

Merge branch 'main' into union-query-ids

5010e6f

james7132 requested a review from MrGVSV March 18, 2024 03:33

james7132 added 3 commits March 18, 2024 00:11

Restore matched_tables and matched_archetypes

ad484ea

Restore Debug implementation

caf91fd

Fix CI and improve performance of bitset joins

c7e5e8e

cBournhonesque reviewed Mar 21, 2024

View reviewed changes

Document more about StorageIds

f77497b

james7132 requested a review from cBournhonesque March 23, 2024 02:14

cBournhonesque approved these changes Mar 23, 2024

View reviewed changes

james7132 added the S-Ready-For-Final-Review This PR has been approved by the community. It's ready for a maintainer to consider merging it label Mar 23, 2024

james7132 added 2 commits March 22, 2024 23:19

Foramtting

361deca

Cleanup parallel iteration code

16fd523

james7132 added 2 commits March 23, 2024 01:57

Fix docs

1893268

Formatting

3cfebae

alice-i-cecile reviewed Mar 25, 2024

View reviewed changes

crates/bevy_ecs/src/query/state.rs Outdated Show resolved Hide resolved

Backticks for CI

c0f0835

alice-i-cecile enabled auto-merge March 25, 2024 18:40

alice-i-cecile added this pull request to the merge queue Mar 25, 2024

alice-i-cecile removed this pull request from the merge queue due to a manual request Mar 25, 2024

james7132 enabled auto-merge March 29, 2024 06:41

james7132 added this pull request to the merge queue Mar 30, 2024

Merged via the queue into bevyengine:main with commit 286bc8c Mar 30, 2024
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store only the IDs needed for Query iteration #12476

Store only the IDs needed for Query iteration #12476

james7132 commented Mar 14, 2024 •

edited

Loading

alice-i-cecile left a comment

james7132 commented Mar 17, 2024 •

edited

Loading

cBournhonesque left a comment

cBournhonesque Mar 21, 2024

james7132 Mar 21, 2024

cBournhonesque Mar 21, 2024

cBournhonesque Mar 21, 2024

cBournhonesque Mar 21, 2024

james7132 Mar 21, 2024

Store only the IDs needed for Query iteration #12476

Store only the IDs needed for Query iteration #12476

Conversation

james7132 commented Mar 14, 2024 • edited Loading

Objective

Solution

Changelog

Migration Guide

alice-i-cecile left a comment

Choose a reason for hiding this comment

james7132 commented Mar 17, 2024 • edited Loading

cBournhonesque left a comment

Choose a reason for hiding this comment

cBournhonesque Mar 21, 2024

Choose a reason for hiding this comment

james7132 Mar 21, 2024

Choose a reason for hiding this comment

cBournhonesque Mar 21, 2024

Choose a reason for hiding this comment

cBournhonesque Mar 21, 2024

Choose a reason for hiding this comment

cBournhonesque Mar 21, 2024

Choose a reason for hiding this comment

james7132 Mar 21, 2024

Choose a reason for hiding this comment

james7132 commented Mar 14, 2024 •

edited

Loading

james7132 commented Mar 17, 2024 •

edited

Loading