Addition of fast division (recursive divrem only) #370

czurnieden · 2019-10-11T16:08:42Z

Direct implementation of algorithms 1.8 "RecursiveDivRem" and 1.9 "UnbalancedDivision"
from:

Brent, Richard P., and Paul Zimmermann. "Modern computer arithmetic"
Vol. 18. Cambridge University Press, 2010
Available online

pages 19ff. in the above online document

MasterDuke17 · 2019-10-11T16:26:33Z

Should this change anything about my in-progress implementation of the faster to_radix?

czurnieden · 2019-10-11T16:47:56Z

@MasterDuke17

Should this change anything about my in-progress implementation of the faster to_radix?

No.
I'm just offering it, it may or may not find it's way into LTM eventually.

minad · 2019-10-16T07:51:19Z

I would like to have this 👍

minad

Could you refactor this a bit, introducing a s_mp_div_small function? Then disabling s_mp_div_recursive/school would result in the small function being used.

   if (MP_HAS(S_MP_DIV_RECURSIVE)
       && (b->used > MP_KARATSUBA_MUL_CUTOFF)
       && (b->used <= ((a->used)/3*2))) {
      err = s_mp_div_recursive(a, b, c, d);
   } else if (MP_HAS(S_MP_DIV_SCHOOL)) {
      err = s_mp_div_school(a, b, c, d);
   } else {
      err = s_mp_div_small(a, b, c, d);
   }

It seems BN_MP_DIV_SMALL is the only macro we have which enables an alternative version being built, circumventing the MP_HAS configuration system.

czurnieden · 2019-10-16T22:21:07Z

Could you refactor this a bit, introducing a s_mp_div_small function?

Yepp, no prob'.

It seems BN_MP_DIV_SMALL is the only macro we have which enables an alternative version being built, circumventing the MP_HAS configuration system.

As we are working towards 2.0.0 now (Ignoring the bugfix-versions 1.2.x): we could change it and include it in the MP_HAS configuration system?

bn_s_mp_div_small.c

czurnieden · 2019-10-17T12:26:33Z

The snippet

   if (MP_HAS(S_MP_DIV_RECURSIVE)
       && (b->used > MP_KARATSUBA_MUL_CUTOFF)
       && (b->used <= ((a->used)/3*2))) {
      err = s_mp_div_recursive(a, b, c, d);
   } else if (MP_HAS(S_MP_DIV_SCHOOL)) {
      err = s_mp_div_school(a, b, c, d);
   } else {
      err = s_mp_div_small(a, b, c, d);
   }

Does not seem to work as intended (with LTM_NOTHING), any suggestions besides bracketing them off the old preprocessor way?

tommath_superclass.h

bn_s_mp_div_school.c

bn_s_mp_div_recursive.c

minad · 2019-10-17T13:28:57Z

What do you mean - does not work as intended? If you compile with LTM_NOTHING, only s_mp_div_small will be used as I see it. And yes, this is 2.0 - we can do breaking changes.

czurnieden · 2019-10-17T14:35:07Z

If you compile with LTM_NOTHING, only s_mp_div_small will be used as I see it.

Only if I remove the guards and that will compile s_mp_school and s_mp_recursive, too, which goes against the intent of s_mp_div_small to decrease the size of the actual lib.

The way our configuration system works now results in the following entry in `tommath_class.h

#if defined(BN_MP_DIV_C)
#   define BN_MP_CMP_MAG_C
#   define BN_MP_COPY_C
#   define BN_MP_ZERO_C
#   define BN_S_MP_DIV_RECURSIVE_C
#   define BN_S_MP_DIV_SCHOOL_C
#   define BN_S_MP_DIV_SMALL_C
#endif

So all three are defined and hence all three get compiled if I remove the guards.

If I don't remove the guards, the functions are not included and the linker complains.

If I add extra guards around the branching like e.g.:

#ifndef  BN_S_MP_DIV_SMALL
   if (MP_HAS(S_MP_DIV_RECURSIVE)
       && (b->used > MP_KARATSUBA_MUL_CUTOFF)
       && (b->used <= ((a->used)/3*2))) {
      err = s_mp_div_recursive(a, b, c, d);
   } else {
      err = s_mp_div_school(a, b, c, d);
   }
#else 
      err = s_mp_div_small(a, b, c, d);
#endif

it works but it looks ugly and defies the reason you introduced MP_HAS(x) for: to get rid of all of these preprocessors branches.

Any suggestions?

minad · 2019-10-17T14:39:53Z

Yes, we should rework the configuration system. Optional dependencies (guarded by MP_HAS) should not be required automatically in tommath_class.h. I made such suggestions already in #301. But this is independent of this PR and we can address this later.

czurnieden · 2019-10-17T15:56:45Z

we can address this later.

OK.
So I removed all BN_S_MP_DIV_SMALL guards (including the definition in tommath_superclass.h).

[...] in #301.

It is still on the TODO list, no worry.
I think I'll boldly add the 2.0.0 milestone.

minad · 2019-10-20T16:51:46Z

@czurnieden this PR seems almost ready. Can you rebase it?

czurnieden · 2019-10-20T20:20:07Z

this PR seems almost ready.

Bugfixes not withstanding, of course, but yes.
Changed label accordingly.

demo/test.c

minad · 2019-10-21T05:51:04Z

demo/test.c

@@ -2325,6 +2325,140 @@ static int test_mp_radix_size(void)
   return EXIT_FAILURE;
 }

+#ifndef S_MP_DIV_SMALL


Remove the guard and test both functions test_s_mp_div_recursive, trst_s_mp_div_small. Additional guards are rarely necessary. MP_HAS takes care of it.

minad · 2019-10-21T14:53:57Z

@czurnieden please squash @sjaeckel this looks ready from my side

minad · 2019-10-21T14:57:41Z

I forgot - does it make sense to add an additional cut off here? Instead of using karatsuba?

czurnieden · 2019-10-21T15:26:01Z

" out-of-date with the base branch"?
You are really busy the last couple of days!

I forgot - does it make sense to add an additional cut off here? Instead of using karatsuba?

Found no differences above Karatsuba only below (it needs fast multiplication to function). The larger difference is in the relation numerator/denominator where it stops being faster approaching ~2/3 and even starts to get slower above 0.8.

I don't think that any kind of tuning would make sense.

minad · 2019-10-21T15:35:16Z

Hmm ok, but I mean is there a theoretical reason why the cutoff should be the same? If there is none, I would prefer if you introduce another constant (even if the value is the same).

czurnieden · 2019-10-21T15:49:48Z

Hmm ok, but I mean is there a theoretical reason why the cutoff should be the same?

As I said: it needs fast multiplication to function and the lowest cutoff for that is the Karatsuba cutoff; there is a direct connection, not just coincidence.

We can ignore Comba here because the most likely reason for not wanting the Comba algorithm is lack of memory which means there is no other fast multiplication in that case and I'm pretty sure no space for fast division either.

minad · 2019-10-21T16:04:25Z

Ok!

czurnieden force-pushed the recursive_division branch from 626b5d5 to 37ba298 Compare October 11, 2019 16:14

minad added this to the v2.0.0 milestone Oct 14, 2019

czurnieden force-pushed the recursive_division branch 2 times, most recently from b9d6e5a to ff6759a Compare October 15, 2019 19:15

minad self-requested a review October 16, 2019 07:53

minad requested changes Oct 16, 2019

View reviewed changes

minad added the work in progress label Oct 16, 2019

czurnieden force-pushed the recursive_division branch from ff6759a to 260b037 Compare October 16, 2019 22:58

minad reviewed Oct 17, 2019

View reviewed changes

bn_s_mp_div_small.c Outdated Show resolved Hide resolved

minad reviewed Oct 17, 2019

View reviewed changes

bn_s_mp_div_small.c Outdated Show resolved Hide resolved

minad reviewed Oct 17, 2019

View reviewed changes

tommath_superclass.h Outdated Show resolved Hide resolved

minad requested changes Oct 17, 2019

View reviewed changes

bn_s_mp_div_school.c Outdated Show resolved Hide resolved

bn_s_mp_div_recursive.c Outdated Show resolved Hide resolved

czurnieden force-pushed the recursive_division branch from 8a6ce73 to 929ef57 Compare October 19, 2019 18:44

czurnieden force-pushed the recursive_division branch from 93d6283 to ef00315 Compare October 20, 2019 20:17

czurnieden added finished and removed work in progress labels Oct 20, 2019

minad reviewed Oct 20, 2019

View reviewed changes

demo/test.c Outdated Show resolved Hide resolved

minad reviewed Oct 20, 2019

View reviewed changes

demo/test.c Outdated Show resolved Hide resolved

minad mentioned this pull request Oct 20, 2019

Corrected type for the sign-handling in bit-banger div #392

Closed

minad reviewed Oct 21, 2019

View reviewed changes

minad self-requested a review October 21, 2019 14:53

minad approved these changes Oct 21, 2019

View reviewed changes

czurnieden force-pushed the recursive_division branch from d0c38cf to a3b2386 Compare October 21, 2019 15:24

Addition of fast division (recursive divrem only)

9edd185

czurnieden force-pushed the recursive_division branch from a3b2386 to 9edd185 Compare October 22, 2019 19:02

sjaeckel approved these changes Oct 23, 2019

View reviewed changes

sjaeckel merged commit 1f210d2 into libtom:develop Oct 23, 2019

sjaeckel removed the finished label Oct 23, 2019

fperrad mentioned this pull request Oct 24, 2019

some linting #411

Merged

minad mentioned this pull request Oct 27, 2019

manual: don't mention obsolete MP_DIV_SMALL #421

Merged

sjaeckel mentioned this pull request Nov 21, 2019

change default branch - regression in prime_is_prime #460

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Addition of fast division (recursive divrem only) #370

Addition of fast division (recursive divrem only) #370

czurnieden commented Oct 11, 2019

MasterDuke17 commented Oct 11, 2019

czurnieden commented Oct 11, 2019

minad commented Oct 16, 2019

minad left a comment •

edited

Loading

czurnieden commented Oct 16, 2019

czurnieden commented Oct 17, 2019

minad commented Oct 17, 2019 •

edited

Loading

czurnieden commented Oct 17, 2019

minad commented Oct 17, 2019

czurnieden commented Oct 17, 2019

minad commented Oct 20, 2019

czurnieden commented Oct 20, 2019

minad Oct 21, 2019

minad commented Oct 21, 2019

minad commented Oct 21, 2019

czurnieden commented Oct 21, 2019

minad commented Oct 21, 2019

czurnieden commented Oct 21, 2019

minad commented Oct 21, 2019

Addition of fast division (recursive divrem only) #370

Addition of fast division (recursive divrem only) #370

Conversation

czurnieden commented Oct 11, 2019

MasterDuke17 commented Oct 11, 2019

czurnieden commented Oct 11, 2019

minad commented Oct 16, 2019

minad left a comment • edited Loading

Choose a reason for hiding this comment

czurnieden commented Oct 16, 2019

czurnieden commented Oct 17, 2019

minad commented Oct 17, 2019 • edited Loading

czurnieden commented Oct 17, 2019

minad commented Oct 17, 2019

czurnieden commented Oct 17, 2019

minad commented Oct 20, 2019

czurnieden commented Oct 20, 2019

minad Oct 21, 2019

Choose a reason for hiding this comment

minad commented Oct 21, 2019

minad commented Oct 21, 2019

czurnieden commented Oct 21, 2019

minad commented Oct 21, 2019

czurnieden commented Oct 21, 2019

minad commented Oct 21, 2019

minad left a comment •

edited

Loading

minad commented Oct 17, 2019 •

edited

Loading