Special condensed packing for point data #33

mourner · 2020-12-10T20:25:48Z

I wonder if it's possible to use twice less memory specifically for point data, where we know that maxX and maxY duplicate minX and minY. This could look like an option in the constructor (onlyPoints = false), and then adding special case handling for indexed leaves.

The text was updated successfully, but these errors were encountered:

bjornharrtell · 2020-12-20T20:42:47Z

This has also crossed my mind a few times. :)

kylebarron · 2023-10-09T22:59:56Z

Is there still interest in this? If it would be considered, I'd consider attempting a PR. My general thoughts would be:

Have a flag on the class for whether the class is point-only or not.
Change the computed nodeByteSize to be 2-per-node for point data

flatbush/index.js

Line 68 in 4ab68d7

const nodesByteSize = numNodes * 4 * this.ArrayType.BYTES_PER_ELEMENT;
Add an addPoint method that takes only x and y as arguments
Update add to throw if this.onlyPoints is true (or maybe only throw if minX !== maxX and minY !== maxY).
Updates to finish to ensure correct sorting for point data. (I know... but seems doable at a glance and left to explore fully in a pr)
Bump the serialized format version number

flatbush/index.js

Line 4 in 4ab68d7

const VERSION = 3; // serialized format version

. We might also have to allocate an extra byte in the header so that there's a way to recollect whether the specified buffer only contains points or not (or if we assume the buffer is always valid input, I suppose you could check whether the length of the buffer matches what you'd expect with either 2-item or 4-item boxes?)

kylebarron · 2023-10-10T15:39:51Z

(I know... but seems doable at a glance and left to explore fully in a pr)

I had some time on a flight, tried to implement this, and learned why drawing the rest of the owl is non-trivial 😉. It took me embarrassingly long to realize that only the leaf nodes contain a single x and y, because all intermediate nodes contain more than one point 🙈.

So it seems like the complexity is mostly in the structure of the boxes and the use of << 2 to navigate the tree. We wouldn't be able to use << 2 directly anymore, because the boxes at the beginning of the tree only take up half the length as expected.

It would increase complexity and might have a small performance regression for the non-point case, but it seems like we could work around this by having a subtraction offset (computed based on the number of items in the tree) and based on the tree level.

This is an interesting problem, and it would be really awesome to have a single tree structure that works equally well for points and non-points, so I think I'll still try to get a solution in the near future

mourner · 2023-10-10T16:31:55Z

@kylebarron awesome, thanks for working on this! I attempted this in the past but run into the same issue — it demanded a significant increase in complexity and branching not just in indexing but search methods too, and at some point it seemed that it would be easier to just fork the project with a separate point-based version rather than implement it all in one class. But if you find an elegant way to incorporate this, I'd be happy to review.

mourner added the enhancement New feature or request label Dec 10, 2020

jbuckmccready mentioned this issue Feb 16, 2021

neighbors support? jbuckmccready/static_aabb2d_index#1

Closed

mootari mentioned this issue Apr 28, 2022

Default add() arguments to zero-width rects? #42

Closed

paddymul mentioned this issue May 27, 2023

extract the 4 offset to a constant #48

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Special condensed packing for point data #33

Special condensed packing for point data #33

mourner commented Dec 10, 2020

bjornharrtell commented Dec 20, 2020

kylebarron commented Oct 9, 2023 •

edited

Loading

kylebarron commented Oct 10, 2023

mourner commented Oct 10, 2023

Special condensed packing for point data #33

Special condensed packing for point data #33

Comments

mourner commented Dec 10, 2020

bjornharrtell commented Dec 20, 2020

kylebarron commented Oct 9, 2023 • edited Loading

kylebarron commented Oct 10, 2023

mourner commented Oct 10, 2023

kylebarron commented Oct 9, 2023 •

edited

Loading