[APM] Metrics powered UI POC #66871

dgieselaar · 2020-05-18T08:08:27Z

This is a POC to demonstrate some of the work we need to do to migrate to a metrics powered UI. The approach is:

in setup_request, determine if, for the given time range, the cluster has 1 or more transaction duration metricsets
If this is the case, use the metrics index and aggregate on transaction.duration.histogram rather than transaction.duration.us

I've opted for this approach because there are several scenarios to consider, and this is the simplest decision tree I can think of. For instance, the user could have transaction duration metrics, but could be looking at an earlier time range, or they could be using the query bar and filtering on non-aggregated fields.

I've also split up the services aggregations into smaller requests (which is recommended to better utilize parallelism.

I haven't done any performance comparisons yet.

To try this out, you'll need to enable aggregation in APM Server: -E apm-server.aggregation.enabled=true. See elastic/apm-server#3651.

Before:

After:

(Some data is off due to (some of the) agents not running for the full 24 hours).

elasticmachine · 2020-05-18T08:08:31Z

Pinging @elastic/apm-ui (Team:apm)

sorenlouv · 2020-05-20T11:42:11Z

x-pack/plugins/apm/common/projections/services.ts

+
+  const index = [
+    indices['apm_oss.metricsIndices'],
+    indices['apm_oss.errorIndices']


Are there any plans to pre-aggregate errors in the future?

sorenlouv · 2020-05-20T11:45:39Z

x-pack/plugins/apm/common/projections/services.ts

+
+  if (!hasTransactionDurationMetrics) {
+    index.push(indices['apm_oss.transactionIndices']);
+  }


Instead of having this for every query, what about letting the APM elasticsearch client handle this? So the consumer side specifies the indicies that should be queried as strings (typescript can enforce correctness):

{ apmIndices: ['transactions', 'errors'] body: {...} }

Then the client will determine which indices should be queried according to hasTransactionDurationMetrics:

const indices = apmIndices.map(apmIndex => { switch(apmIndex) { case 'span': return indices['apm_oss.spanIndices']; case 'error': return indices['apm_oss.errorIndices']; case 'metric': return indices['apm_oss.metricsIndices']; case 'transaction': return hasTransactionDurationMetrics ? indices['apm_oss.metricsIndices'] : indices['apm_oss.transactionIndices'] } })

We need this client anyway to make it easier when teams outside APM needs want to query apm data (siem for instance)

sorenlouv · 2020-05-20T11:54:23Z

x-pack/plugins/apm/common/projections/services.ts

-          filter: [
-            {
-              terms: { [PROCESSOR_EVENT]: ['transaction', 'error', 'metric'] }
-            },


Removing the processor.event filter could cause problems because by default we query apm-* (all indices). So this query will return transactions, metrics, errors and spans by default.
I think you'll need the same logic as for the index.

I think the queried indicies and processor.event should always be aligned. I was hoping we could add the processor.event filter in the setup (middleware) but it might be better with an explicit helper. Something like:

const { index, processorEvent } = hasTransactionDurationMetrics ? // transactions are stored as pre-aggregated (histogram) docs in metric index getIndexAndProcessorEvent('metric', 'error') : // transactions are not pre-aggregated and stored in transaction index getIndexAndProcessorEvent('metric', 'error', transaction);

dgieselaar · 2020-06-03T08:34:19Z

Based on some data analysis, we are likely not going to aggregate on user_agent.original as it greatly increases the number of metric documents that are stored. That means that we cannot migrate some of the RUM charts for now, and users that have RUM services and want to use those charts, will have to keep storing unsampled transactions until we find a fix for this issue. See elastic/apm-server#3841.

sorenlouv · 2020-05-20T12:17:47Z

x-pack/plugins/apm/server/lib/helpers/setup_request.ts

+  const start =
+    'start' in query ? { start: moment.utc(query.start).valueOf() } : {};
+  const end = 'end' in query ? { end: moment.utc(query.end).valueOf() } : {};


What about:

Suggested change

const start =

'start' in query ? { start: moment.utc(query.start).valueOf() } : {};

const end = 'end' in query ? { end: moment.utc(query.end).valueOf() } : {};

const start = 'start' in query ? moment.utc(query.start).valueOf() : undefined;

const end = 'end' in query ? moment.utc(query.end).valueOf() : undefined;

Then you don't have to do start.start and end.end. I think start and end should always be returned, regardless if they are undefined

sorenlouv · 2020-05-20T12:20:31Z

x-pack/plugins/apm/server/lib/helpers/setup_request.ts

+  const checkTransactionDurationMetrics = async () => {
+    const response = await client.search({
+      index: indices['apm_oss.metricsIndices'],
+      body: {
+        query: {
+          bool: {
+            filter: [
+              { term: { [PROCESSOR_EVENT]: 'metric' } },
+              ...(start.start && end.end
+                ? [{ range: rangeFilter(start.start, end.end) }]
+                : []),
+              { exists: { field: TRANSACTION_DURATION_HISTOGRAM } }
+            ]
+          }
+        },
+        size: 0
+      },
+      terminateAfter: 1
+    });
+
+    return {
+      hasTransactionDurationMetrics: response.hits.total.value > 0
+    };
+  };


Perhaps extract this to separate file to avoid making setup_request too overwhelming

sorenlouv · 2020-06-03T11:17:56Z

x-pack/plugins/apm/common/projections/services.ts

+    end,
+    uiFiltersES,
+    indices,
+    hasTransactionDurationMetrics


Nit about the naming: Is it important to note that it's TransactionDuration docs, and not just Transaction docs?

What about hasPreAggregatedTransactions or isPreAggregationEnabled (if we in the future want to pre-aggregate other data types, eg. errors ?

kibanamachine · 2020-06-08T09:00:45Z

💔 Build Failed

continuous-integration/kibana-ci/pull-request
Commit: f2714bf
Pipeline Steps (look for red circles / failed steps)
Interpreting CI Failures

Failed CI Steps

Test Failures

Kibana Pipeline / x-pack-intake-agent / X-Pack Jest Tests.x-pack/plugins/apm/common.Transaction TRANSACTION_DURATION_HISTOGRAM

Link to Jenkins

Standard Out

Failed Tests Reporter:
  - Test has not failed recently on tracked branches

Stack Trace

Error: expect(received).toMatchSnapshot()

New snapshot was not written. The update flag must be explicitly passed to write a new snapshot.

This is likely because this test is run in a continuous integration (CI) environment in which snapshots are not written by default.

Received value undefined
    at Object.it (/dev/shm/workspace/kibana/x-pack/plugins/apm/common/elasticsearch_fieldnames.test.ts:176:21)
    at Object.asyncJestTest (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/jasmineAsyncInstall.js:102:37)
    at resolve (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/queueRunner.js:43:12)
    at new Promise (<anonymous>)
    at mapper (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/queueRunner.js:26:19)
    at promise.then (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/queueRunner.js:73:41)
    at process._tickCallback (internal/process/next_tick.js:68:7)

Kibana Pipeline / x-pack-intake-agent / X-Pack Jest Tests.x-pack/plugins/apm/common.Span TRANSACTION_DURATION_HISTOGRAM

Link to Jenkins

Standard Out

Failed Tests Reporter:
  - Test has not failed recently on tracked branches

Stack Trace

Error: expect(received).toMatchSnapshot()

New snapshot was not written. The update flag must be explicitly passed to write a new snapshot.

This is likely because this test is run in a continuous integration (CI) environment in which snapshots are not written by default.

Received value undefined
    at Object.it (/dev/shm/workspace/kibana/x-pack/plugins/apm/common/elasticsearch_fieldnames.test.ts:176:21)
    at Object.asyncJestTest (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/jasmineAsyncInstall.js:102:37)
    at resolve (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/queueRunner.js:43:12)
    at new Promise (<anonymous>)
    at mapper (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/queueRunner.js:26:19)
    at promise.then (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/queueRunner.js:73:41)
    at process._tickCallback (internal/process/next_tick.js:68:7)

Kibana Pipeline / x-pack-intake-agent / X-Pack Jest Tests.x-pack/plugins/apm/common.Error TRANSACTION_DURATION_HISTOGRAM

Link to Jenkins

Standard Out

Failed Tests Reporter:
  - Test has not failed recently on tracked branches

Stack Trace

Error: expect(received).toMatchSnapshot()

New snapshot was not written. The update flag must be explicitly passed to write a new snapshot.

This is likely because this test is run in a continuous integration (CI) environment in which snapshots are not written by default.

Received value undefined
    at Object.it (/dev/shm/workspace/kibana/x-pack/plugins/apm/common/elasticsearch_fieldnames.test.ts:176:21)
    at Object.asyncJestTest (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/jasmineAsyncInstall.js:102:37)
    at resolve (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/queueRunner.js:43:12)
    at new Promise (<anonymous>)
    at mapper (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/queueRunner.js:26:19)
    at promise.then (/dev/shm/workspace/kibana/node_modules/jest-jasmine2/build/queueRunner.js:73:41)
    at process._tickCallback (internal/process/next_tick.js:68:7)

and 2 more failures, only showing the first 3.

History

💔 Build #48577 failed cb45b3a

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

dgieselaar · 2020-09-17T07:12:49Z

Superseded by #73953

[APM] Metrics powered UI POC

cb45b3a

botelastic bot added the Team:APM All issues that need APM UI Team support label May 18, 2020

sorenlouv linked an issue May 20, 2020 that may be closed by this pull request

[APM] Support latency metrics #62459

Closed

sorenlouv reviewed May 20, 2020

View reviewed changes

sorenlouv added release_note:enhancement v7.9.0 labels Jun 3, 2020

sorenlouv reviewed Jun 3, 2020

View reviewed changes

dgieselaar added 3 commits June 4, 2020 22:49

Merge branch 'master' of github.com:elastic/kibana into metrics-ui-poc

10cc497

Use max size

0ebef5d

Merge branch 'master' of github.com:elastic/kibana into metrics-ui-poc

f2714bf

axw mentioned this pull request Jul 29, 2020

Document transaction metrics elastic/apm-server#4031

Closed

spalger added v7.9.2 and removed v7.9.0 labels Sep 3, 2020

dgieselaar closed this Sep 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[APM] Metrics powered UI POC #66871

[APM] Metrics powered UI POC #66871

dgieselaar commented May 18, 2020 •

edited

Loading

elasticmachine commented May 18, 2020

sorenlouv May 20, 2020 •

edited

Loading

sorenlouv May 20, 2020 •

edited

Loading

sorenlouv May 20, 2020 •

edited

Loading

dgieselaar commented Jun 3, 2020

sorenlouv May 20, 2020

sorenlouv May 20, 2020

sorenlouv Jun 3, 2020

kibanamachine commented Jun 8, 2020

Standard Out

Stack Trace

Standard Out

Stack Trace

Standard Out

Stack Trace

dgieselaar commented Sep 17, 2020

[APM] Metrics powered UI POC #66871

[APM] Metrics powered UI POC #66871

Conversation

dgieselaar commented May 18, 2020 • edited Loading

elasticmachine commented May 18, 2020

sorenlouv May 20, 2020 • edited Loading

Choose a reason for hiding this comment

sorenlouv May 20, 2020 • edited Loading

Choose a reason for hiding this comment

sorenlouv May 20, 2020 • edited Loading

Choose a reason for hiding this comment

dgieselaar commented Jun 3, 2020

sorenlouv May 20, 2020

Choose a reason for hiding this comment

sorenlouv May 20, 2020

Choose a reason for hiding this comment

sorenlouv Jun 3, 2020

Choose a reason for hiding this comment

kibanamachine commented Jun 8, 2020

💔 Build Failed

Failed CI Steps

Test Failures

Standard Out

Stack Trace

Standard Out

Stack Trace

Standard Out

Stack Trace

History

dgieselaar commented Sep 17, 2020

dgieselaar commented May 18, 2020 •

edited

Loading

sorenlouv May 20, 2020 •

edited

Loading

sorenlouv May 20, 2020 •

edited

Loading

sorenlouv May 20, 2020 •

edited

Loading