Support pushing large number of items in tree iterators #12172

martin-fleck-at · 2023-02-10T15:48:17Z

What it does

Provide utility to push in a callstack-safe manner
Add tests

Fixes #12171

How to test

Test cases are included in the PR.

Review checklist

As an author, I have thoroughly tested my changes and carefully followed the review guidelines

Reminder for reviewers

As a reviewer, I agree to behave in accordance with the review guidelines

martin-fleck-at · 2023-02-17T13:12:16Z

I believe the build failure has nothing to do with my change, it seems to happen on all open PRs.

tsmaeder

If I ask myself: "does this PR make the code better/easier to understand/faster", I'm not sure we're going in the right direction. Do we have measurements that support these performance optimisations?

tsmaeder · 2023-02-17T13:57:42Z

packages/core/src/browser/tree/tree-iterator.ts

        stack.push(root);
        while (stack.length > 0) {
            const top = stack.pop()!;
            yield top;
-            stack.push(...(children(top) || []).filter(include).reverse());
+            stack = ArrayUtils.pushAll(stack, (children(top) || []).filter(include).reverse());


For a piece of micro-optimized code, it seems rather strange to do first a copy of the array, then a filter, then a "reverse".

I am also not sure why it was done that way but I didn't want to touch existing functionality without fully understanding the logic behind it.

tsmaeder · 2023-02-17T14:11:06Z

packages/core/src/common/array-utils.ts

+            // typically faster but might fail depending on the number of items and the callstack size
+            array.push(...items);
+            return array;
+        } catch (error) {


I really would advise against this. Did someone actually ever measure the impact of TreeIterator performance on real world performance in Theia? Is our version even better than just doing a recursive version? VMs usually can optimize tail recursion into iteration anyway these days.

What do you mean exactly by our version vs recursive version? In our scenario we had a table will millions of data sets and everything was still working very quickly.

The simplest depth first tree traversal is recursive. We're using a stack here, for some reason. Since I couldn't find any info why we're using a more complicated version of tree traversal here, I feel it may be a case of premature optimization.

I just tested a little bit locally (nothing scientific for sure) against this recursive version:

export function* depthFirst<T>(root: T, ...): IterableIterator<T> { yield root; const sortedChildren = (children(root) || []).filter(include).reverse(); for (const child of sortedChildren) { yield* depthFirst(child, children); } }

and I'd say with 5.000.000 I saw about 10-15% improvement with the stack-version vs the recursive version. So I think it is useful to keep it like that for now. It also matches the breadth-first search so it is not that hard to read.

martin-fleck-at · 2023-02-17T14:37:22Z

@tsmaeder Thank you for having such a quick look! It's not really a performance optimization and more of a "getting it to work at all" with large number of items. I did a very quick performance here: https://jsbench.me/rjle2jyvie. So basically concat was faster than a forEach but I am not sure what you are suggesting.

tsmaeder · 2023-02-17T14:54:59Z

What I'm suggesting is that in real life, it might not really make a difference which is faster. How long are we talking in absolute terms? If a user interaction takes 0.12 instead of 0.08 seconds, it's really does not matter and we should simply not optimize for speed. The relative speeds are not really relevant. Do we have to use the spread operator, knowing it might blow the stack?

martin-fleck-at · 2023-02-17T15:01:53Z

@tsmaeder Now I get it, sorry for the confusion! I didn't do any particular measurements in this direction yet but as a user it really didn't matter whether we use the spread operator on smaller lists or use concat on them in the first place - I didn't try to forEach variant in my use case. So in my opinion, the array probably has to be enormous for the user to notice any difference. However, it may also depend on the size of the nodes in the tree?

So we should remove the util again and simply use one of the methods, right? Do you have any preference which method to use?

tsmaeder · 2023-02-17T15:17:27Z

it may also depend on the size of the nodes in the tree

Probably not: the array should reference objects on the heap (pointers)

I would just remove the spread, unless we can observe noticeably worse performance with "concat".

martin-fleck-at · 2023-02-21T11:07:28Z

@tsmaeder Thanks again for having a look. I pushed an update where I simply replace the spread with the concat.

tsmaeder

Looks fine to me now.

- Provide utility to push in a callstack-safe manner - Add tests Fixes #12171

martin-fleck-at force-pushed the issues/12171 branch 5 times, most recently from b6951be to 933cb7f Compare February 17, 2023 12:42

martin-fleck-at requested a review from msujew February 17, 2023 13:11

tsmaeder reviewed Feb 17, 2023

View reviewed changes

martin-fleck-at removed the request for review from msujew February 17, 2023 14:45

martin-fleck-at force-pushed the issues/12171 branch from 933cb7f to 583af17 Compare February 21, 2023 10:50

tsmaeder mentioned this pull request Feb 21, 2023

rpc: maximum call stack size exceeded #12173

Closed

martin-fleck-at force-pushed the issues/12171 branch from 583af17 to 26e0182 Compare February 21, 2023 13:23

martin-fleck-at force-pushed the issues/12171 branch from 26e0182 to e21d396 Compare February 28, 2023 17:00

msujew requested a review from tsmaeder April 17, 2023 14:29

tsmaeder approved these changes Apr 20, 2023

View reviewed changes

martin-fleck-at force-pushed the issues/12171 branch from e21d396 to 7ae0521 Compare April 20, 2023 14:50

martin-fleck-at added 2 commits April 21, 2023 10:03

Support pushing large number of items in tree iterators

d6e0bde

- Provide utility to push in a callstack-safe manner - Add tests Fixes #12171

Simplify by just removing spread operator

a5ca4e4

martin-fleck-at force-pushed the issues/12171 branch from 7ae0521 to a5ca4e4 Compare April 21, 2023 08:03

martin-fleck-at merged commit 2deedba into master Apr 21, 2023

martin-fleck-at deleted the issues/12171 branch April 21, 2023 08:33

github-actions bot added this to the 1.37.0 milestone Apr 21, 2023

This was referenced Nov 27, 2023

[Snyk] Fix for 6 vulnerabilities magnologan/theia#18

Open

[Snyk] Security upgrade @theia/application-package from 1.31.0 to 1.37.0 magnologan/theia#19

Open

[Snyk] Fix for 6 vulnerabilities magnologan/theia#21

Open

This was referenced Dec 21, 2023

[Snyk] Fix for 6 vulnerabilities magnologan/theia#63

Open

[Snyk] Fix for 6 vulnerabilities magnologan/theia#64

Open

[Snyk] Fix for 6 vulnerabilities magnologan/theia#65

Open

[Snyk] Fix for 6 vulnerabilities magnologan/theia#66

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support pushing large number of items in tree iterators #12172

Support pushing large number of items in tree iterators #12172

martin-fleck-at commented Feb 10, 2023

martin-fleck-at commented Feb 17, 2023

tsmaeder left a comment

tsmaeder Feb 17, 2023

martin-fleck-at Feb 21, 2023

tsmaeder Feb 17, 2023

martin-fleck-at Feb 17, 2023

tsmaeder Feb 17, 2023

martin-fleck-at Feb 21, 2023

martin-fleck-at commented Feb 17, 2023

tsmaeder commented Feb 17, 2023

martin-fleck-at commented Feb 17, 2023 •

edited

Loading

tsmaeder commented Feb 17, 2023

martin-fleck-at commented Feb 21, 2023

tsmaeder left a comment

Support pushing large number of items in tree iterators #12172

Support pushing large number of items in tree iterators #12172

Conversation

martin-fleck-at commented Feb 10, 2023

What it does

How to test

Review checklist

Reminder for reviewers

martin-fleck-at commented Feb 17, 2023

tsmaeder left a comment

Choose a reason for hiding this comment

tsmaeder Feb 17, 2023

Choose a reason for hiding this comment

martin-fleck-at Feb 21, 2023

Choose a reason for hiding this comment

tsmaeder Feb 17, 2023

Choose a reason for hiding this comment

martin-fleck-at Feb 17, 2023

Choose a reason for hiding this comment

tsmaeder Feb 17, 2023

Choose a reason for hiding this comment

martin-fleck-at Feb 21, 2023

Choose a reason for hiding this comment

martin-fleck-at commented Feb 17, 2023

tsmaeder commented Feb 17, 2023

martin-fleck-at commented Feb 17, 2023 • edited Loading

tsmaeder commented Feb 17, 2023

martin-fleck-at commented Feb 21, 2023

tsmaeder left a comment

Choose a reason for hiding this comment

martin-fleck-at commented Feb 17, 2023 •

edited

Loading