feat: memory profiling #524

seemk · 2022-08-07T12:39:09Z

tldr

Use v8::HeapProfiler to periodically capture an allocation profile along with the samples.
The allocation samples returned from v8 are "live memory", so we have to keep track which samples were added.
Samples have a node_id which is its node ID in the call graph. To generate a stack trace this node needs to be found from the allocation profile.
For the finding the whole allocation profile graph is converted to a more convenient format, so when we have a node id, we can do a dictionary lookup and reconstruct the path to a root callgraph node which maps nicely to the pprof location array. The conversion requires generating the whole graph. There is room for improvement here (e.g. caching the node graph and only regenerate it when we find an unknown node_id from the samples), but at the moment it seems to be fast enough.

⚠️ Trace and span ID correlation can't be done because V8 does not provide any timing information for the samples.

Misc

Removed obsolete Node.js versions (<12) from prebuild:os script

* refactor: move parseEndpoint into utils * feat: implement exporting pprof * fix: add missing attributes * fix: fix message on debug logging * fix: force logs endpoint to collector * feat: take OTLP endpoint for a default * Change files

codecov-commenter · 2022-08-07T12:45:23Z

Codecov Report

Merging #524 (d9a517a) into main (96a308a) will decrease coverage by 1.70%.
The diff coverage is 75.00%.

@@            Coverage Diff             @@
##             main     #524      +/-   ##
==========================================
- Coverage   88.94%   87.24%   -1.71%     
==========================================
  Files          27       27              
  Lines         914      972      +58     
  Branches      204      210       +6     
==========================================
+ Hits          813      848      +35     
- Misses        101      124      +23

Impacted Files	Coverage Δ
src/profiling/DebugExporter.ts	`18.75% <0.00%> (-11.25%)`	⬇️
src/profiling/types.ts	`100.00% <ø> (ø)`
src/profiling/OTLPProfilingExporter.ts	`36.06% <10.00%> (-12.78%)`	⬇️
src/profiling/index.ts	`90.00% <100.00%> (+2.34%)`	⬆️
src/profiling/utils.ts	`90.81% <100.00%> (+1.65%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

aurbiztondo-splunk · 2022-08-09T05:56:23Z

docs/profiling.md

+
+### Memory profiling
+
+Memory profiling is disabled by default, it can be enabled via the `memoryProfilingEnabled` flag.


Suggestion: Memory profiling is disabled by default. You can enable it via the memoryProfilingEnabled flag.

Thanks! Perhaps you already know, but GitHub lets you add suggestions directly to the comment via a suggest code block, so the author can press a button to bring the change in, for example:

Suggested change

Memory profiling is disabled by default, it can be enabled via the `memoryProfilingEnabled` flag.

Memory profiling is disabled by default. You can enable it via the `memoryProfilingEnabled` flag.

aurbiztondo-splunk · 2022-08-09T05:57:48Z

docs/profiling.md

+
+Internally the profiler uses V8's sampling heap profiler, where it periodically queries for new allocation samples from the allocation profile.
+
+The [V8 heap profiler's parameters](https://v8.github.io/api/head/classv8_1_1HeapProfiler.html#a6b9450bbf1f4e1a4909df92d4df4a174) can additionally be tuned by an optional `memoryProfilingOptions` configuration field:


Suggestion: You can tune V8 heap profiler's parameters using the memoryProfilingOptions configuration field:

rauno56

Mainly curious questions.

test/profiling/extension.test.ts

rauno56 · 2022-09-12T12:37:15Z

src/native_ext/memory_profiling.cpp

+
+  auto jsResult = Nan::New<v8::Object>();
+  auto jsSamples = Nan::New<v8::Array>();
+  auto jsNodeTree = Nan::New<v8::Object>();


Are the node id's unique across the lifetime of the program? I ask to suss out why use object instead of an array where the keys are integers already - saves a conversion and perhaps some (type) errors(even though arr[2] === arr['2']).

Node ID is an incrementing counter. No idea how large it gets and how many gaps will exist. Just used a denser form 🤷‍♂️

const arr = []; array[5] = "hello"; array[10] = "world";

Makes a "holey" array, which is equivalent in terms of the density. I assume calling the same API from the native side behaves the same.

Tested it and it actually turns to DICTIONARY_ELEMENTS type even with a small app as the node IDs get large, thus it's basically the same as using an object 🤔 And for some reason the average read speed was actually faster when using an object (no idea why) while the write speeds stayed the same.

src/native_ext/memory_profiling.cpp

src/native_ext/profiling.cpp

src/native_ext/memory_profiling.cpp

src/native_ext/profiling.cpp

rauno56 and others added 16 commits August 1, 2022 10:27

feat: export pprof (#517)

4107568

* refactor: move parseEndpoint into utils * feat: implement exporting pprof * fix: add missing attributes * fix: fix message on debug logging * fix: force logs endpoint to collector * feat: take OTLP endpoint for a default * Change files

fix: add source.event.period to samples

2c12b57

fix: handle unknown line numbers

12f3ba1

initial memory samples serialization

fdc0976

fix: don't overwrite stacktrace lines array

95fa794

Merge branch 'pprof-export' into memory-profiling

f03d78d

feat: simplify sample collection

eb96e3f

feat: bfs traversal of allocations

dadab9e

fix: keep track of new samples

80e2e09

feat: add stop for memory profiling, clean out printouts

bd432c8

refactor: remove duplicate code

67b4f1d

refactor: remove duplicate code, fix cpu serialization test

9b7c3ce

fix: add timestamps to heap profile, add a test

00028b2

test: add test for heap profile collection

417558a

test: add test for memory profiling export

2e5f1d7

chore: remove unsupported versions from prebuilds

0759bff

seemk requested review from a team as code owners August 7, 2022 12:39

doc: update profiling docs

b28140f

seemk requested a review from a team as a code owner August 7, 2022 12:42

test: reduce allocation test flakiness

ea1d844

Base automatically changed from pprof-export to main August 7, 2022 14:11

seemk added 2 commits August 7, 2022 17:15

Merge branch 'main' into memory-profiling

e95beae

doc: add a doc about memory profiling

0bf4bf9

aurbiztondo-splunk reviewed Aug 9, 2022

View reviewed changes

aurbiztondo-splunk approved these changes Aug 9, 2022

View reviewed changes

doc: improve wording

ba710a9

rauno56 reviewed Sep 12, 2022

View reviewed changes

seemk added 3 commits September 13, 2022 10:31

refactor: remove unused function

6b6d482

Merge branch 'main' into memory-profiling

d9a517a

refactor: use range based for

284799e

rauno56 approved these changes Sep 15, 2022

View reviewed changes

seemk merged commit ad845c4 into main Sep 16, 2022

seemk deleted the memory-profiling branch September 16, 2022 12:59

seemk mentioned this pull request Sep 19, 2022

chore: release v1.4.0 #554

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: memory profiling #524

feat: memory profiling #524

seemk commented Aug 7, 2022 •

edited

Loading

codecov-commenter commented Aug 7, 2022 •

edited

Loading

aurbiztondo-splunk Aug 9, 2022

seemk Aug 9, 2022

aurbiztondo-splunk Aug 9, 2022

rauno56 left a comment

rauno56 Sep 12, 2022

seemk Sep 13, 2022

rauno56 Sep 15, 2022

seemk Sep 16, 2022


		### Memory profiling

		Memory profiling is disabled by default, it can be enabled via the `memoryProfilingEnabled` flag.

	Memory profiling is disabled by default, it can be enabled via the `memoryProfilingEnabled` flag.
	Memory profiling is disabled by default. You can enable it via the `memoryProfilingEnabled` flag.


		Internally the profiler uses V8's sampling heap profiler, where it periodically queries for new allocation samples from the allocation profile.

		The [V8 heap profiler's parameters](https://v8.github.io/api/head/classv8_1_1HeapProfiler.html#a6b9450bbf1f4e1a4909df92d4df4a174) can additionally be tuned by an optional `memoryProfilingOptions` configuration field:

feat: memory profiling #524

feat: memory profiling #524

Conversation

seemk commented Aug 7, 2022 • edited Loading

tldr

Misc

codecov-commenter commented Aug 7, 2022 • edited Loading

Codecov Report

aurbiztondo-splunk Aug 9, 2022

Choose a reason for hiding this comment

seemk Aug 9, 2022

Choose a reason for hiding this comment

aurbiztondo-splunk Aug 9, 2022

Choose a reason for hiding this comment

rauno56 left a comment

Choose a reason for hiding this comment

rauno56 Sep 12, 2022

Choose a reason for hiding this comment

seemk Sep 13, 2022

Choose a reason for hiding this comment

rauno56 Sep 15, 2022

Choose a reason for hiding this comment

seemk Sep 16, 2022

Choose a reason for hiding this comment

seemk commented Aug 7, 2022 •

edited

Loading

codecov-commenter commented Aug 7, 2022 •

edited

Loading