Improve buffer settings of ES Loaders to allow for higher throughput #378

jbeemster · 2023-03-21T04:13:14Z

With v0.16.0 a lot of the memory settings were tuned which yielded fairly good results:

Large: 100-200 RPS (even split between good and bad)
XLarge: ~800-900 RPS
XXLarge: ~3000 RPS

However the common factor (outside of being CPU bound for throughput) was that the ES Loader could not keep up with the load. Digging into it this does not seem to stem from ES Cluster as an upstream bottleneck but rather that we are inserting 1 event at a time into Elasticsearch.

https://github.com/snowplow/snowplow-mini/blob/master/provisioning/resources/configs/snowplow-es-loader-good.hocon#L38-L43
https://github.com/snowplow/snowplow-mini/blob/master/provisioning/resources/configs/snowplow-es-loader-bad.hocon#L38-L43

This was presumably done so that events land in ES as soon as they are sent but it means that we have fairly terrible throughput at the higher end.

jbeemster · 2023-03-21T19:16:55Z

cc/ @istreeter as you had some ideas in this area for a future version.

…lose #378)

istreeter mentioned this issue Mar 21, 2023

NSQ executor should periodically flush buffer snowplow/snowplow-elasticsearch-loader#256

Closed

istreeter added a commit that referenced this issue Mar 21, 2023

Improve buffer settings of ES Loaders to allow for higher throughput (c…

154cd33

…lose #378)

istreeter added a commit that referenced this issue Mar 21, 2023

Improve buffer settings of ES Loaders to allow for higher throughput (c…

53c6035

…lose #378)

istreeter added a commit that referenced this issue Mar 21, 2023

Improve buffer settings of ES Loaders to allow for higher throughput (c…

9121e5e

…lose #378)

istreeter mentioned this issue Mar 22, 2023

NSQ executor should periodically flush buffer snowplow/snowplow-elasticsearch-loader#254

Closed

istreeter closed this as completed in b365a7e Mar 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve buffer settings of ES Loaders to allow for higher throughput #378

Improve buffer settings of ES Loaders to allow for higher throughput #378

jbeemster commented Mar 21, 2023

jbeemster commented Mar 21, 2023

Improve buffer settings of ES Loaders to allow for higher throughput #378

Improve buffer settings of ES Loaders to allow for higher throughput #378

Comments

jbeemster commented Mar 21, 2023

jbeemster commented Mar 21, 2023