perf: optimize label sanitization #95

kruskall · 2023-06-30T13:01:32Z

The following code was used to microbenchmark the sanitizeLabel function:

package modelpb

import "testing"

func BenchmarkSanitize(b *testing.B) {
        small := "foobar"
        l := "foobarfoobarfoobarfoobarfoobarfoobarfoobar"
        b.Run("small", func(b *testing.B) {
                for i := 0; i < b.N; i++ {
                        sanitizeLabelKey(small)
                }
        })
        b.Run("l", func(b *testing.B) {
                for i := 0; i < b.N; i++ {
                        sanitizeLabelKey(l)
                }
        })
}

Benchmarks show a performance improvement for both small and large label keys:

                  │  before.txt  │              after.txt              │
                  │    sec/op    │   sec/op     vs base                │
Sanitize/small-20   16.100n ± 1%   9.191n ± 1%  -42.91% (p=0.000 n=10)
Sanitize/l-20        81.55n ± 1%   29.35n ± 2%  -64.01% (p=0.000 n=10)
geomean              36.23n        16.42n       -54.67%

                  │  before.txt  │              after.txt              │
                  │     B/op     │    B/op     vs base                 │
Sanitize/small-20   0.000 ± 0%     0.000 ± 0%       ~ (p=1.000 n=10) ¹
Sanitize/l-20       0.000 ± 0%     0.000 ± 0%       ~ (p=1.000 n=10) ¹
geomean                        ²               +0.00%                ²
¹ all samples are equal
² summaries must be >0 to compute geomean

                  │  before.txt  │              after.txt              │
                  │  allocs/op   │ allocs/op   vs base                 │
Sanitize/small-20   0.000 ± 0%     0.000 ± 0%       ~ (p=1.000 n=10) ¹
Sanitize/l-20       0.000 ± 0%     0.000 ± 0%       ~ (p=1.000 n=10) ¹

Closes #49

Benchmarks show a performance improvement for both small and large label keys: │ before.txt │ after.txt │ │ sec/op │ sec/op vs base │ Sanitize/small-20 16.100n ± 1% 9.191n ± 1% -42.91% (p=0.000 n=10) Sanitize/l-20 81.55n ± 1% 29.35n ± 2% -64.01% (p=0.000 n=10) geomean 36.23n 16.42n -54.67% │ before.txt │ after.txt │ │ B/op │ B/op vs base │ Sanitize/small-20 0.000 ± 0% 0.000 ± 0% ~ (p=1.000 n=10) ¹ Sanitize/l-20 0.000 ± 0% 0.000 ± 0% ~ (p=1.000 n=10) ¹ geomean ² +0.00% ² ¹ all samples are equal ² summaries must be >0 to compute geomean │ before.txt │ after.txt │ │ allocs/op │ allocs/op vs base │ Sanitize/small-20 0.000 ± 0% 0.000 ± 0% ~ (p=1.000 n=10) ¹ Sanitize/l-20 0.000 ± 0% 0.000 ± 0% ~ (p=1.000 n=10) ¹

kruskall added 2 commits June 30, 2023 14:51

perf: optimise for sanitised labels by only updating the maps if needed

50ff0db

kruskall changed the title ~~Perf/optimize labels~~ perf: optimize label sanitisation Jun 30, 2023

kruskall changed the title ~~perf: optimize label sanitisation~~ perf: optimize label sanitization Jun 30, 2023

kruskall requested a review from a team June 30, 2023 13:08

carsonip approved these changes Jun 30, 2023

View reviewed changes

kruskall merged commit 60568a2 into elastic:main Jun 30, 2023

kruskall deleted the perf/optimize-labels branch June 30, 2023 13:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: optimize label sanitization #95

perf: optimize label sanitization #95

kruskall commented Jun 30, 2023

perf: optimize label sanitization #95

perf: optimize label sanitization #95

Conversation

kruskall commented Jun 30, 2023