No optimization for Scala parallelized collections #25

ochafik · 2015-03-16T22:26:45Z

From @fdietze on November 10, 2011 16:21

It seems like there is no optimization for the parallelized collections in Scala.

This is optimized:
(0 until 1000).map

While this is not:
(0 until 1000).par.map

Whats the easiest way to get the parallelized collections optimized? The CL-Collections?

Thanks for this great compiler plugin. It helped me a lot speeding up my existing project.

Copied from original issue: nativelibs4java/nativelibs4java#199

ochafik · 2015-03-16T22:26:50Z

Hi fdietze,

Thanks for your feedback !

The plugin optimizes code that leaves room for optimization. This is the case of Range, where a rewrite into while loops can speed things up a lot. With parallel collections though, it is not clear how to make the code run faster, since rewriting the calls into while loops is no longer an (easy) option.
ScalaCL collections can indeed provide some acceleration, but with some trade-offs : less operations are actually supported in an efficient way, and data copies to and from the collections can be very costly and should be done with care.

What kind of optimization do you have in mind ?

Cheers

ochafik · 2015-03-16T22:26:50Z

From @fdietze on November 11, 2011 0:6

Hi ochafik,

thanks for your answer.

I'm thinking about something similar of what OpenMP does. Because we have loops with a fixed number of iterations, we can split the range in chunks of size (iterations/#cpus) and run them independently with different threads and while loops. But I don't know if thats as trivial as the other transformations are...

ochafik · 2015-03-16T22:26:50Z

Hi fdietze,

This seems indeed far from being trivial, especially without hints to the compiler (and my guess is that the overall gain, if any, would not justify the work).
I'm afraid I do not have the resources to explore this path currently, but feel free to explore it and give suggestions / status report within this issue report.

Cheers

ochafik · 2015-03-16T22:26:50Z

For the record, here's a document that explains how the OpenMP parallel loops work & look like :
http://bisqwit.iki.fi/story/howto/openmp/#LoopDirectiveFor

ochafik added the ScalaCL Plugin label Mar 16, 2015

ochafik mentioned this issue Mar 16, 2015

No optimization for Scala parallelized collections nativelibs4java/nativelibs4java#199

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No optimization for Scala parallelized collections #25

No optimization for Scala parallelized collections #25

ochafik commented Mar 16, 2015

ochafik commented Mar 16, 2015

ochafik commented Mar 16, 2015

ochafik commented Mar 16, 2015

ochafik commented Mar 16, 2015

No optimization for Scala parallelized collections #25

No optimization for Scala parallelized collections #25

Comments

ochafik commented Mar 16, 2015

ochafik commented Mar 16, 2015

ochafik commented Mar 16, 2015

ochafik commented Mar 16, 2015

ochafik commented Mar 16, 2015