chore: Remove USE_ALIGNED_ACCESS and enhance BYTE_ORDER handling #2456

mapleFU · 2024-07-31T10:30:21Z

Redis uses lot of USE_ALIGNED_ACCESS as the "fastpath" for ARM like archtecture, however, I think modern compiler can handle this kind of optimization well. So this part of code is removed.

Besides, some vendored libraray uses BYTE_ORDER macro, this might not being defined in these files. So I use BYTE_ORDER instead

PragmaTwice · 2024-07-31T10:58:15Z

src/types/redis_bitmap.cc

-            }
-          }
-        }
-#endif


I'm wondering if the performance difference between this fast path and vanilla algorithm is large.

Personally I think https://github.com/apache/arrow/blob/main/cpp/src/arrow/util/bitmap_ops.h#L160-L242 would be better, I'll do a benchmark there

I've checked the current impl, and I guess the fast path would be faster😅

FYI: https://godbolt.org/z/dnK6xbzvj
@PragmaTwice @git-hulk

You can use https://quick-bench.com/ which can generate a chart.

I revert the part because the underlying code is too slow 😅

Besides, I change to use memcpy to generalize it. I'll try a benchmark later

apache/arrow@76cebfa
Lucky, arrow has similiar testing here. On My MacOS M1Pro with Release -O2:

BenchmarkBitmapVisitBitsetAnd/32768/0 753392 ns 749634 ns 937 bytes_per_second=41.687M/s BenchmarkBitmapVisitBitsetAnd/131072/0 2986097 ns 2985449 ns 234 bytes_per_second=41.8698M/s BenchmarkBitmapVisitBitsetAnd/32768/1 746267 ns 746040 ns 939 bytes_per_second=41.8878M/s BenchmarkBitmapVisitBitsetAnd/131072/1 2991597 ns 2990679 ns 234 bytes_per_second=41.7965M/s BenchmarkBitmapVisitBitsetAnd/32768/2 747519 ns 747314 ns 940 bytes_per_second=41.8164M/s BenchmarkBitmapVisitBitsetAnd/131072/2 2985102 ns 2984500 ns 234 bytes_per_second=41.8831M/s

The code has no different from bit-hacking and

src/vendor/murmurhash2.h

mapleFU · 2024-08-03T15:16:31Z

@git-hulk @PragmaTwice I've paste the result #2456 (comment)

https://github.com/apache/arrow/blob/45b176716cc667384577a2a1218c6da454854109/cpp/src/arrow/util/bit_util_benchmark.cc#L165-L189

The code runs same speed with highly optimized code in macos, and x86 would share this optimization

sonarcloud · 2024-08-03T16:48:36Z

Quality Gate passed

Issues
4 New issues
0 Accepted issues

Measures
0 Security Hotspots
59.6% Coverage on New Code
1.0% Duplication on New Code

See analysis details on SonarCloud

mapleFU · 2024-08-07T12:19:57Z

@PragmaTwice would you mind check again?

tweak port parts

23f0d13

mapleFU requested a review from PragmaTwice July 31, 2024 10:30

PragmaTwice reviewed Jul 31, 2024

View reviewed changes

git-hulk reviewed Jul 31, 2024

View reviewed changes

src/vendor/murmurhash2.h Show resolved Hide resolved

git-hulk previously approved these changes Jul 31, 2024

View reviewed changes

git-hulk and others added 3 commits July 31, 2024 20:40

Merge branch 'unstable' into tweak-port-parts

abc0fed

build(cmake): update compiler version requirement (apache#2455)

6cb3985

Revert back bitmap_op fast path

ad5d8d2

mapleFU dismissed git-hulk’s stale review via ad5d8d2 August 1, 2024 05:16

mapleFU added 2 commits August 1, 2024 13:16

Merge branch 'unstable' into tweak-port-parts

dfabf28

minor

edd1c63

mapleFU requested a review from PragmaTwice August 1, 2024 05:17

mapleFU added 3 commits August 1, 2024 13:58

fixup

28130ce

fix distance compute

521a070

Merge branch 'unstable' into tweak-port-parts

b72679d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: Remove USE_ALIGNED_ACCESS and enhance BYTE_ORDER handling #2456

chore: Remove USE_ALIGNED_ACCESS and enhance BYTE_ORDER handling #2456

mapleFU commented Jul 31, 2024

PragmaTwice Jul 31, 2024

mapleFU Aug 1, 2024

mapleFU Aug 1, 2024

mapleFU Aug 1, 2024

PragmaTwice Aug 1, 2024

mapleFU Aug 1, 2024

mapleFU Aug 3, 2024 •

edited

Loading

mapleFU commented Aug 3, 2024

sonarcloud bot commented Aug 3, 2024

mapleFU commented Aug 7, 2024

chore: Remove USE_ALIGNED_ACCESS and enhance BYTE_ORDER handling #2456

Are you sure you want to change the base?

chore: Remove USE_ALIGNED_ACCESS and enhance BYTE_ORDER handling #2456

Conversation

mapleFU commented Jul 31, 2024

PragmaTwice Jul 31, 2024

Choose a reason for hiding this comment

mapleFU Aug 1, 2024

Choose a reason for hiding this comment

mapleFU Aug 1, 2024

Choose a reason for hiding this comment

mapleFU Aug 1, 2024

Choose a reason for hiding this comment

PragmaTwice Aug 1, 2024

Choose a reason for hiding this comment

mapleFU Aug 1, 2024

Choose a reason for hiding this comment

mapleFU Aug 3, 2024 • edited Loading

Choose a reason for hiding this comment

mapleFU commented Aug 3, 2024

sonarcloud bot commented Aug 3, 2024

Quality Gate passed

mapleFU commented Aug 7, 2024

mapleFU Aug 3, 2024 •

edited

Loading