Very slow startup: can this be mitigated? #211

rschwiebert · 2024-10-10T01:34:20Z

After observing the first computation done by multik's arrays seemed slow, I was surprised that the following toy script

import org.jetbrains.kotlinx.multik.api.linalg.dot
import org.jetbrains.kotlinx.multik.api.mk
import org.jetbrains.kotlinx.multik.api.ndarray
import kotlin.random.Random


fun main() {
    repeat(10) {
        val a = mk.ndarray(listOf(listOf(Random.nextDouble())))
        val b = mk.ndarray(listOf(listOf(Random.nextDouble())))
        val start = System.nanoTime()
        a dot b
        println(System.nanoTime() - start)
    }
}

reliably resulted in results on this order when run in intellij:

12440996083
58417
18541
14458
15042
16833
14041
15042
36708
67042

Before I knew the magnitude of the difference between the first item and the rest I was writing it off as some sort of cacheing effect. But now it really seems like it's some sort of lazy initialization or JIT compiling or something that has nothing to do with the computation at hand.

Can this be mitigated? I tried with both 0.2.2 and 0.2.3 with the same results. My laptop is an M1 chip mac, but I'm ultimately interested in running it with ubuntu distros.

Thanks in advance for your advice.

New info:
Running the same test on an ubuntu instance, it does seem like the delay is magnitudes smaller there (only 0.2s). Maybe this is something to do with my laptop architecture or something.

The text was updated successfully, but these errors were encountered:

devcrocod · 2024-10-10T17:21:16Z

Hi, thank you report. It's really interesting stuff

Your assessment is correct: the first execution is indeed longer than later ones. The numbers and the difference may not be entirely accurate, as many factors can influence this.

There are mainly two reasons why the first computation is slower. And yeah, you're right, it's a lazy initialization in the following way:

The implementation engine (kotlin or native) is located. This takes little computational time, but there are some overheads associated with it.
In the case of Kotlin, the computation proceeds immediately afterward. But if you use native, the native library is first loaded (System.load), which takes the majority of the time during this call. After that, a jni call occurs, which incurs slightly more overhead for the first invocation.

It is not possible to eliminate or speed up the native library loading process. This behavior is inherent to the JVM itself. You can use a pure kotlin implementation — multik-kotlin, without a native library. However, the computations will be slower in this case.

You can also pre-load the native library in advance by calling mk.math or mk.linalg before your main computations.

If it is critical for you to measure performance, including the first computation, I recommend using jmh or kotlin-benchmarks. This will more accurately measure the specific computation and reduce the impact of external factors, such as other processes

PS: Kotlin has a convenient function measureTime - https://kotlinlang.org/docs/time-measurement.html#measure-time

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Very slow startup: can this be mitigated? #211

Very slow startup: can this be mitigated? #211

rschwiebert commented Oct 10, 2024 •

edited

Loading

devcrocod commented Oct 10, 2024

Very slow startup: can this be mitigated? #211

Very slow startup: can this be mitigated? #211

Comments

rschwiebert commented Oct 10, 2024 • edited Loading

devcrocod commented Oct 10, 2024

rschwiebert commented Oct 10, 2024 •

edited

Loading