[Security Solution] Kibana OOM and crashing when running indicator match rule #118560

marshallmain · 2021-11-15T15:58:58Z

Describe the bug:
Kibana's memory usage increases significantly when running a particular indicator match rule with ~520k indicator items. The rule is scheduled to run every 30 minutes, and every 30 minutes Kibana's memory usage increases to the limit (4GB in this case) and it crashes. The system returned to normal once the rule was disabled. The logs indicate that the rule executes for ~12 minutes before Kibana crashes, and during that time the rule does not appear to finish executing.

Kibana/Elasticsearch Stack version:
7.15.1

Steps to reproduce:

Enable APM in Kibana. Ensure that captureSpanStackTraces: true is set in the APM config. This will happen by default in 7.15.1 if you set a serverUrl in the APM config.

module.exports = {
  active: true,
};

Create an index with ~500k documents to use as an indicator index. I used the es archive x-pack/test/functional/es_archives/filebeat/default, but modified it by removing the _id from each doc in the archive and creating ~80 copies of the data.json file. Using es_archiver on this folder created an index with ~500k docs.
Create an index with a single document for the indicator match rule to query.

PUT /test-index
{
  "mappings": {
    "properties": {
      "@timestamp": {
        "type": "date"
      },
      "host.name": {
        "type": "keyword"
      }
    }
  }
}

POST test-both/_doc
{
  "@timestamp": 1636493980000,
  "host.name": "myHost"
}

Create an indicator match rule that uses the filebeat index as the indicator index and test-index as the source index pattern. Activate the rule.
Observe as Kibana's memory usage increases. After a few minutes, Kibana should crash with an out of memory error. With the filebeat indicator index described above, Kibana crashed after processing ~300k indicator items and using 4GB of memory.

Current behavior:
Kibana crashes due to running out of memory. It appears that the APM agent may be storing the entire response for every Elasticsearch query within a transaction. Since task manager runs tasks within a transaction, every Elasticsearch query response from a rule execution is being stored in memory.

Without APM enabled, the same rule executes without crashing Kibana.

Expected behavior:
Kibana should not run out of memory, even with APM enabled.

The text was updated successfully, but these errors were encountered:

marshallmain · 2022-03-29T05:13:51Z

Closing as this is not a bug in the indicator match rule logic.

marshallmain self-assigned this Nov 15, 2021

trentm mentioned this issue Nov 17, 2021

kibana OOM; captureSpanStackTraces:true implicated elastic/apm-agent-nodejs#2453

Open

marshallmain closed this as completed Mar 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Security Solution] Kibana OOM and crashing when running indicator match rule #118560

[Security Solution] Kibana OOM and crashing when running indicator match rule #118560

marshallmain commented Nov 15, 2021 •

edited

Loading

marshallmain commented Mar 29, 2022

[Security Solution] Kibana OOM and crashing when running indicator match rule #118560

[Security Solution] Kibana OOM and crashing when running indicator match rule #118560

Comments

marshallmain commented Nov 15, 2021 • edited Loading

marshallmain commented Mar 29, 2022

marshallmain commented Nov 15, 2021 •

edited

Loading