perf(linter): use binary_search instead of contains #4446

togami2864 · 2024-11-01T13:55:36Z

Summary

Replace all .contains with .binsearch.

Test Plan

Added tests to ensure values are sorted.

codspeed-hq · 2024-11-01T14:38:56Z

CodSpeed Performance Report

Merging #4446 will not alter performance

_{Comparing togami2864:perf/binsearch (9a55fc8) with main (f38694c)}

Summary

✅ 99 untouched benchmarks

arendjr

Can we enforce this at the type-level somehow? The obvious solution would be to use a BtreeSet instead. I’m a bit afraid someone would add an entry and overlook the fact they need to be ordered, and we’d have a bug.

Conaclos · 2024-11-02T10:43:45Z

Can we enforce this at the type-level somehow? The obvious solution would be to use a BtreeSet instead. I’m a bit afraid someone would add an entry and overlook the fact they need to be ordered, and we’d have a bug.

We could add unit tests as we did for the sorted arrays of JS builtins.

arendjr · 2024-11-02T10:55:53Z

Adding tests would be another approach indeed, but do we know the benefit of this approach to begin with? If there are no benchmarks, we're just complicating trivial functionality for unclear gain.

It could even very well be that we're making things slower with this approach: https://www.reddit.com/r/rust/comments/1anlbui/comment/kpxl77q/

Conaclos · 2024-11-02T11:45:33Z

It could even very well be that we're making things slower with this approach: https://www.reddit.com/r/rust/comments/1anlbui/comment/kpxl77q/

Yes, indeed, For small arrays, linear search is always the fastest approach. It is unclear to me what "small" is. The link you shared suggests 100 items. However, this also depends on the complexity of the comparison function that is really cheap for integers and a bit more complex for strings.
Moreover, Rust 1.82 (released in October 2024) introduced a rewrite of the binary search implementation (See the associated changelog entry and PR). It should now be faster than in the previous versions. Notably, when LLVM is able to determine the slice length (that is likely to happen in the current use case), it generates a compact branch-less code. Thus, I am unsure whether it makes a real difference of using one or the other in our use case.
Another approach is using a perfect hash functions. If I remember correctly it is also slower than a linear search for small arrays of strings.

I tend to use linear search for very small arrays (8 items or fewer). We could evaluate if the number should be higher. For quite-small arrays binary search should be good enough.

By the way, if we write tests to check order we could use the recently stabilized is_sorted method.

togami2864 self-assigned this Nov 1, 2024

github-actions bot added A-Linter Area: linter L-CSS Language: CSS labels Nov 1, 2024

arendjr reviewed Nov 1, 2024

View reviewed changes

togami2864 added 4 commits November 12, 2024 00:12

chore: use binary_search instead of contains

26925b0

chore: add test cases

f3189b7

fix: order

d53c8df

chore: use is_sorted()

9a55fc8

togami2864 force-pushed the perf/binsearch branch from 0f26644 to 9a55fc8 Compare November 12, 2024 01:15

togami2864 marked this pull request as ready for review November 14, 2024 11:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(linter): use binary_search instead of contains #4446

perf(linter): use binary_search instead of contains #4446

togami2864 commented Nov 1, 2024 •

edited

Loading

codspeed-hq bot commented Nov 1, 2024 •

edited

Loading

arendjr left a comment

Conaclos commented Nov 2, 2024

arendjr commented Nov 2, 2024 •

edited

Loading

Conaclos commented Nov 2, 2024 •

edited

Loading

perf(linter): use binary_search instead of contains #4446

Are you sure you want to change the base?

perf(linter): use binary_search instead of contains #4446

Conversation

togami2864 commented Nov 1, 2024 • edited Loading

Summary

Test Plan

codspeed-hq bot commented Nov 1, 2024 • edited Loading

CodSpeed Performance Report

Merging #4446 will not alter performance

Summary

arendjr left a comment

Choose a reason for hiding this comment

Conaclos commented Nov 2, 2024

arendjr commented Nov 2, 2024 • edited Loading

Conaclos commented Nov 2, 2024 • edited Loading

togami2864 commented Nov 1, 2024 •

edited

Loading

codspeed-hq bot commented Nov 1, 2024 •

edited

Loading

arendjr commented Nov 2, 2024 •

edited

Loading

Conaclos commented Nov 2, 2024 •

edited

Loading