Add a fast integer divide that rounds to zero #6455

abadams · 2021-11-30T21:20:35Z

While working on legacy code I discovered a need for this. Performance test shows a good speed-up over native division for vector code:

signed division rounding to zero:
type            const-divisor speed-up  runtime-divisor speed-up
 Int(32,  1)     2.416                   1.153
 Int(16,  1)     2.552                   1.457
 Int( 8,  1)     1.782                   0.667
 Int(32,  8)     8.592                   5.908
 Int(16, 16)    53.008                  38.505
 Int( 8, 32)    19.480                   8.197

dsharletg · 2021-11-30T21:26:33Z

src/FastIntegerDivide.cpp

+        Expr xsign = select(numerator > 0, cast(t, 0), cast(t, -1));
+
+        // Multiply-keep-high-half
+        result = (cast(wide, mul) * numerator);


I think this should use widening_mul intrinsics, because uses of this are after find_intrinsics. Maybe this whole sequence should be mul_shift_right.

Actually this code is only called directly by users, so it's before find_intrinsics. The compiler doesn't ever call this.

Maybe add this as a comment for future readers.

I actually think we should change it to intrinsics anyways. But since the code is just moved and pre-existing, maybe it should be a separate PR.

steven-johnson · 2021-11-30T21:46:38Z

test/performance/const_division.cpp


-        // Reference good version
-        g(x, y) = input(x, y) / cast<T>(y + min_val);
+            // Reference good version


This looks identical to the case just above, are they supposed to be identical?

Yes, they have different schedules which turn the denominator into a constant in one case but not the other.

(I'll add a comment)

steven-johnson · 2021-11-30T21:47:24Z

tools/find_inverse.cpp

+bool srz_method_0(int den, int sh_post, int bits) {
+    int64_t min = -(1L << (bits - 1)), max = (1L << (bits - 1)) - 1;
+    for (int64_t num = min; num <= max; num++) {
+        // for (int iter = 0; iter < 1000000L; iter++) {


Why is this commented out? If it's being left in for (eg) debugging purposes, please say so.

Fixed (deleted)

abadams · 2021-11-30T22:00:11Z

See also related issue #6456

abadams · 2021-12-02T14:07:25Z

review ping

steven-johnson

LGTM

steven-johnson · 2021-12-02T17:41:51Z

src/FastIntegerDivide.cpp

+        Expr xsign = select(numerator > 0, cast(t, 0), cast(t, -1));
+
+        // Multiply-keep-high-half
+        result = (cast(wide, mul) * numerator);


Maybe add this as a comment for future readers.

lordnn · 2022-09-10T21:50:20Z

buf(x) = fast_integer_divide_round_to_zero(select(x % 2 == 0, 5, -5), 2);
result is:
2, -3, 2, -3, 2, -3
Not rounded to zero.

abadams · 2022-09-10T23:05:44Z

Looks like there's a bug in the handling of constant denominators (an early-out path that assumes we're rounding to -infinity). Will fix.

abadams · 2022-09-11T00:15:38Z

See #7008

abadams added 3 commits November 30, 2021 13:17

Add a version of fast_integer_divide that rounds towards zero

aa10a41

clang-format

67f0170

Fix test condition

0c12734

abadams requested a review from dsharletg November 30, 2021 21:20

dsharletg reviewed Nov 30, 2021

View reviewed changes

steven-johnson reviewed Nov 30, 2021

View reviewed changes

abadams added 2 commits November 30, 2021 13:55

Clean up debugging code

914cbdd

Add explanatory comment to performance test

61aabe3

Pacify clang tidy

f215365

steven-johnson approved these changes Dec 2, 2021

View reviewed changes

dsharletg approved these changes Dec 2, 2021

View reviewed changes

abadams merged commit 7992369 into master Dec 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a fast integer divide that rounds to zero #6455

Add a fast integer divide that rounds to zero #6455

abadams commented Nov 30, 2021

dsharletg Nov 30, 2021

abadams Nov 30, 2021

steven-johnson Dec 2, 2021

dsharletg Dec 2, 2021

steven-johnson Nov 30, 2021

abadams Nov 30, 2021

abadams Nov 30, 2021

steven-johnson Nov 30, 2021

abadams Nov 30, 2021

abadams commented Nov 30, 2021

abadams commented Dec 2, 2021

steven-johnson left a comment

steven-johnson Dec 2, 2021

lordnn commented Sep 10, 2022

abadams commented Sep 10, 2022

abadams commented Sep 11, 2022

Add a fast integer divide that rounds to zero #6455

Add a fast integer divide that rounds to zero #6455

Conversation

abadams commented Nov 30, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

abadams commented Nov 30, 2021

abadams commented Dec 2, 2021

steven-johnson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lordnn commented Sep 10, 2022

abadams commented Sep 10, 2022

abadams commented Sep 11, 2022