Add support for AVX #121

james7132 · 2024-03-21T05:48:45Z

This PR lowers the requirement for 256-bit wide vectors on x86/x86_64 platforms from AVX2 to AVX. #86 mistakenly assumes all of the operations are not available until AVX2, while in reality operations working on __m256d were viable from the start. The main difference is that the code doesn't really use the floating point operations on the type, so the two can be treated the same.

Performance-wise, benchmarks were run and there were zero shown deviations in performance between AVX and AVX2 other than some other incidental speedups in non-set/batch operations.

Used the opportunity to clean up the block directory and deduplicate repeated code and clean up some of the cfg attributes. Also added the new compilation configurations to CI.

james7132 added 6 commits March 20, 2024 22:44

Add support for AVX

74c1651

Formatting

94e772f

Try to fix CI

6e4d588

Fix aarch64

c4dd0cc

Try fixing tests again

dcd96df

Fix AVX

37f2f41

james7132 merged commit 2937449 into petgraph:master Mar 21, 2024
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for AVX #121

Add support for AVX #121

james7132 commented Mar 21, 2024 •

edited

Loading

Add support for AVX #121

Add support for AVX #121

Conversation

james7132 commented Mar 21, 2024 • edited Loading

james7132 commented Mar 21, 2024 •

edited

Loading