Remove `weight` method and introduce `sequential-threshold` #81

nikomatsakis · 2016-08-05T10:02:37Z

As I wrote in #49: "Currently parallel iterators assume cheap operations. I am thinking that should be changed to assume expensive operations (i.e., fine-grained parallel splits), and have people opt-in to the current behavior by manually adjusting the weights or calling weight_min."

This branch implements that idea. The weight method is deprecated and a new sequential_threshold(N) method is added. Calling with a value N means that, if Rayon has less than N items remaining, it will not attempt to spawn another thread. So if you set the threshold to 222, and you had 400 items total, then Rayon would first split into threads, each processing 200 items, and then stop. (The docs point out that this is not a guarantee, however, and your code cannot rely on this for correctness.)

I still think this is the right thing -- but I was surprised by how drastically it affected some fine-grained benchmarks, which really suffered unless the threshold is added in. (I expected them to go slow, but not as slow as they did.) This is probably just highlighting optimization that needs to be done.

I'm curious to hear what people think. cc @cuviper, who always has good advice, and @dirvine, who participated on #49. =)

Fixes #49

cuviper · 2016-08-14T23:19:16Z

In general, I like it. The weight was a fairly abstract concept, but this should be easier to understand how the code will execute, since you're almost directly controlling that.

My only hesitation is that this threshold can't be stacked the same way weights were multiplied. That means the caller has to understand their whole chain in one place. Maybe that's fine.

In the recommendation for setting this right before the final action, I think it might be better if this were actively enforced. I'm imagining moving those final actions into a new ParallelFinalize trait, with the general requirement of ParallellIterator: ParallelFinalize, but then Threshold would only implement the latter. This should also simplify a lot of the other types so they don't have to reason about base thresholds at all -- it's only either an implicit 1 or directly overridden from Threshold.

nikomatsakis · 2016-08-15T15:26:57Z

My only hesitation is that this threshold can't be stacked the same way weights were multiplied. That means the caller has to understand their whole chain in one place. Maybe that's fine.

Yeah, that's awkward. I liked how in theory if you had something that yielded like a fn foo() -> impl ParallelIterator, it could (internally) apply some weights, that would compose nicely.

Is there a way to make the "threshold" compose better? It is sort of counterintuitive to me that it doesn't work...

nikomatsakis · 2016-08-15T15:27:38Z

In the recommendation for setting this right before the final action, I think it might be better if this were actively enforced.

Yeah I guess this might be better, though I'd prefer to find something more composable.

I still think "heavy" is the right default though.

nikomatsakis · 2016-08-15T15:28:30Z

OTOH what I really want is to make schedule more adaptive and/or efficient so that it adjusts weights automatically. =) But I'm nervous about this, I don't know of any project that truly claims "success" in this area.

iqualfragile · 2016-09-14T10:24:44Z

Maybe the weight problem could be helped by providing some constants, called for example cheap, medium heavy, that would make people tag their tasks correctly, and you could still give more precise values.

nikomatsakis · 2016-09-15T16:48:25Z

I like the idea of a simplified "cheap vs default vs expensive" API. I will play with that.

FWIW, I tried playing around with some simple heuristics for "auto-thresholding". In particular, I experiment with, after a split, checking when you go to do the RHS if a steal has occurred. If no steal, then we would forego further splitting. This definitely affected performance, but generally made things worse. I have to do more experimentation but as I said I think for short term at least (if not forever) having some hints from user has to be helpful.

cuviper · 2016-09-15T16:55:24Z

FWIW, I was also playing with the split-after-steal idea last night, making sure to split at least NCPUS times before running locally. I found it made some of our poorly weighted benchmarks behave a lot better, but it did worse on some that were previously well tuned. So I think there may be something here for a default unweighted case, but it needs to trust user input too. I'll play more and share code if I get something I think is worthwhile...

nikomatsakis · 2016-10-14T13:12:40Z

After having let this sit for a bit, I think i've come to a few conclusions:

changing the default still feels right;
but I'd like to simplify the interface to something like weight(CHEAP | EXPENSIVE | DEFAULT), where DEFAULT is equivalent to EXPENSIVE for now but may become something like "adaptive".
- not sure yet if this should be a "threshold" or not, but the key point is that the main API ought to be very simple to use without having to enter numbers.

nikomatsakis · 2016-10-18T10:11:15Z

closing in favor of #106

nikomatsakis mentioned this pull request Aug 5, 2016

Change default weight of parallel iterators to assume expensive ops #49

Closed

nikomatsakis added 10 commits September 14, 2016 05:12

drive-by cleanup of nbody

fb359c8

deprecate weight and remove some uses of it

007ffb6

introduce a new notion of sequential cutoff

6a40a01

sort submodules (more?) alphabetically

52fac56

introduce new sequential_threshold method

bf1d1cc

port demo to use sequential_threshold method

242a90d

fix sieve demo

2ee62a7

get tests passing

0e64244

adjust benchmarks

aa16e3f

add mergesort to travis configuration

275e1ec

nikomatsakis force-pushed the no-more-weight branch from 2ca44f0 to 275e1ec Compare September 14, 2016 09:23

cuviper mentioned this pull request Oct 2, 2016

parallel map not parallel #101

Closed

nikomatsakis closed this Oct 18, 2016

clamydo mentioned this pull request Jan 11, 2017

deprecate existing weight APIs #111

Closed

nikomatsakis deleted the no-more-weight branch May 23, 2017 09:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove `weight` method and introduce `sequential-threshold` #81

Remove `weight` method and introduce `sequential-threshold` #81

nikomatsakis commented Aug 5, 2016

cuviper commented Aug 14, 2016

nikomatsakis commented Aug 15, 2016

nikomatsakis commented Aug 15, 2016

nikomatsakis commented Aug 15, 2016

iqualfragile commented Sep 14, 2016

nikomatsakis commented Sep 15, 2016

cuviper commented Sep 15, 2016 via email

nikomatsakis commented Oct 14, 2016

nikomatsakis commented Oct 18, 2016

Remove weight method and introduce sequential-threshold #81

Remove weight method and introduce sequential-threshold #81

Conversation

nikomatsakis commented Aug 5, 2016

cuviper commented Aug 14, 2016

nikomatsakis commented Aug 15, 2016

nikomatsakis commented Aug 15, 2016

nikomatsakis commented Aug 15, 2016

iqualfragile commented Sep 14, 2016

nikomatsakis commented Sep 15, 2016

cuviper commented Sep 15, 2016 via email

nikomatsakis commented Oct 14, 2016

nikomatsakis commented Oct 18, 2016

Remove `weight` method and introduce `sequential-threshold` #81

Remove `weight` method and introduce `sequential-threshold` #81