Memory usage #310

carsotho · 2015-11-25T09:06:08Z

I am noticing a big increase in memory usage between the releases 3.1.3 and 4.0.0 and above.

Our app is running on node 0.12.6 and we create our elasticsearch client with the following properties:

var esClient = new elasticsearch.Client({
  hosts: localhost:9200,
  apiVersion: "1.3",
  minSockets: 10,
  maxSockets: 10
});

When running on 3.1.3 we converge around 350-400MB. When upgrading to 4.0.0 or any later release the memory just seems to grow until hitting the v8-limit (1,4GB).

The text was updated successfully, but these errors were encountered:

spalger · 2015-11-25T16:52:33Z

Weird, I can't find any change that might cause increased memory usage between 3.1.3 and 4.0. I did update forever-agent between 4.0.0 and 4.0.1 though, which might have caused something. I'll try to reproduce and use git bisect to track down the issue.

RWOverdijk · 2016-03-21T12:53:07Z

Any leads on this?

spalger · 2016-03-21T23:37:08Z

Nope, memory usage is something we have been trying to tackle over in elastic/kibana but haven't had luck reproducing or fixing. It seems the only real progress was made when Node released updates and ambient memory usage drops just dropped.

I'm not sure how to approach this and without solid evidence of an issue it's hard to dedicate time.

dylancwood · 2016-04-02T05:14:09Z

I think that I am experiencing the same issue. We are running a relatively thin proxy layer between our mobile clients and elastic search. Each node of that layer is receiving around 300 requests per minute.
When we upgraded to Node 4+, we ran into max-stack depth exceeded errors with elasticsearch-js 3.1.3, so we upgraded to the latest version.

Memory consumption went up about four-fold with just this little change. To confirm that elasticsearch-js is responsible, I forked the repo and fixed the max-stack depth exceeded issue by pulling in a change from later versions of elasticsearch-js (https://github.com/dylancwood/elasticsearch-js/blob/3-1-3-speed/src/lib/log.js#L66).

Below are graphs of memory consumption, throughput and response time for two servers. Both servers are running Node 4.3.0, and were inundated with as many requests as they could handle for about 15 minutes (hence the poor response time). The only difference is that the red one is using elasticsearch-js 4.0.0 and the yellow one is using my patched fork of elasticsearch-js 3.1.3. Note that the response time is actually better on the 4.0.0 version 👍, but 700MB of memory is a high price to pay. Also, I cannot explain why the response time is faster on the 4.0.0 server, but the throughput is lower...

@spalger: I hope you'll consider this to be solid evidence. FTR, we use only three methods of this package: mget, search and indices.analyze.

dylancwood · 2016-04-02T05:21:18Z

Also, @spalger: would you take a PR that patches 3.1.3 to have the EventEmitter.listenerCount updates? Maybe you can create a 3.1.4 branch and cherry-pick the patch into it, then tag + publish?

spalger · 2016-04-02T10:44:49Z

Wow, that looks like pretty solid evidence @dylancwood ... I would love to try bisecting to identify where this memory usage spike came from, and I'll release a patch to 3.1 in the mean time. Thank you for the report

dylancwood · 2016-04-03T04:13:00Z

👍

RWOverdijk · 2016-04-03T09:25:08Z

@dylancwood Thanks for going the extra mile on this! Nice :)

andrewstoker · 2016-06-21T12:34:22Z

@spalger Is this leaky behaviour still being investigated in the newer versions beyond 3.1.4? We are on node v4.4.5 and with any elasticsearch.js client above 4.x, the memory usage is far higher, leaking and behaving much differently as observed by @dylancwood. See below two charts.

V3.1.4 client:

V11.0.1 client:

Both are with the following client configuration (hosts commented out) :

elasticsearch: {
        hosts:           [
            'http://xxxxx:9200',
            'http://xxxxx:9200',
            'http://xxxxx:9200'
        ],
        minSockets:             10,
        maxSockets:             100
    }

ghost · 2016-07-09T11:56:53Z

@spalger

I have yet to organise our application memory stats as neatly as the other posters have done above but I do believe we are experiencing these exact same issues in our pre-production environment.

Node 5.11.1
ES Client 11.01

Currently writing only several entries per second to the ES Index but memory usage keeps accumulating steadily over time until we run out.

Based on the information provided by the other posters I switched the ES client package from v11.01 to v3.1.4 yesterday. Using the same configuration provided below I can confirm that the v3.1.4 ES Client seems to maintain a steady memory usage of around 35-40MB, while the v11.01 ES Client is running around 140MB. This is after running for approx. 19hrs.

<hostname>:~$ pm2 list
┌────────────────────────────────┬────┬─────────┬──────┬────────┬─────────┬────────┬──────────────┬──────────┐
│ App name                       │ id │ mode    │ pid  │ status │ restart │ uptime │ memory       │ watching │
├────────────────────────────────┼────┼─────────┼──────┼────────┼─────────┼────────┼──────────────┼──────────┤
│ nl_es_stat_update_collector │ 0  │ cluster │ 1637 │ online │ 0       │ 19h    │ 148.605 MB   │ disabled │
│ nl_es_stat_update_collector │ 2  │ cluster │ 1618 │ online │ 0       │ 19h    │ 30.105 MB    │ disabled │
└────────────────────────────────┴────┴─────────┴──────┴────────┴─────────┴────────┴──────────────┴──────────┘

Below is the ES configuration we are using (reading from a Redis channel and writing to ES Index). Note the v3.1.4 ES Client does not support API version 2.3, which is the version of our ES Cluster so I commented it out.

var elasticsearch = require('elasticsearch');

// Define ES Client variable
var es_client = new elasticsearch.Client({
    //apiVersion: '2.3',
    host: '<ip address>:9200',
    keepalive: 'true',
    maxKeepAliveTime: '500',
    maxSockets: 15,
    minSockets: 10,
    sniffOnStart: false,
    log: {
      level: 'info'
    }
});

// Dispatch NL Update to Elasticsearch
sub.on('message', function(channel, message) {
    console.log(message);
    es_client.index({
      index: ('nl_stat_update_2016.' + es_index_date),
      type: 'nl_stat_update',
      body: {
        timestamp: new Date(),
        station_update: (JSON.parse(message))
      }
    })
});

As things stand we are not able to use the v11.01 ES Client in production. This means we need to use the old v3.1.4, which means the ES Cluster (2.3) and ES Client will have mismatched API capabilities.
We would very much appreciate your team continuing research into this issue.

Regards,
Reinier

spalger · 2016-07-11T18:37:10Z

Thanks all for the updated info folks, keep it coming.

andrewstoker · 2016-07-13T15:43:14Z

More to think of here.
I think the RSS leak we are seeing may be related to the promise libraries being used in v4 elasticsearch client and above. Especially when calling es client repeatedly (i.e in a loop or quick succession in response to say a rabbitmq feed). Each get/search/put request in the es client returns a promise - these are placed into old memory but not being garbage collected natively. Whereas bluebird behaves differently and will garbage collect the old memory.

I know the below scripts are not elastic client specific but its concise enough to show the behaviour of bluebird with promises returned as part of a loop vs native promises.

This script will not leak (bluebird 3.1.1, node 4.4.7)

var Promise = require('bluebird');

function immediate() {
    return new Promise(function(resolve) {
        setImmediate(resolve);
    });
}

(function loop(value) {
    return immediate().then(function() {
        if (value % 10000 === 0) {
            console.log(process.memoryUsage().heapUsed, value);
        }
        return value + 1;
    }).then(loop);
})(0);

This will (using native promise library)

function immediate() {
    return new Promise(function(resolve) {
        setImmediate(resolve);
    });
}

(function loop(value) {
    return immediate().then(function() {
        if (value % 10000 === 0) {
            console.log(process.memoryUsage().heapUsed, value);
        }
        return value + 1;
    }).then(loop);
})(0);

Can someone validate? Maybe something further to discuss in bluebird/node.js community?

andrewstoker · 2016-07-14T09:38:29Z

I have just realised that bluebird was not removed from the elastic client until version 10. The RSS leak I see is in v4 and above. So this issue could be something totally different (i.e in the lodash library or even in node.js itself).
The leak is complex to debug. A heap dump trace is not helping much as the Old memory space is being collected before the heap is taken (i.e a forced GC frees up the memory).

However RSS leaks in loops of promises (or rapidly called as in my use case) does appear to be a known issue with the native Promise library in the V8 engine. Unfortunately it does not look like it can be fixed as its a spec issue. [(https://github.com/nodejs/node/issues/6673)]

General consensus from the internet is to use bluebird for server side which is far more performant for such a purpose, but use native promises for client side (for browser compatibility). I haven't tried switching the elastic client back to use bluebird promise library in the most recent versions yet (thats my next task), but like i said this could all be a red herring and nothing to do with bluebird/promises at all.

However there are workarounds to the RSS leak that I should note here. These are a combination of the following settings (try one then the other/both).

Set the node param --max-old-space-size to a value suitable for your environment (especially for cloud based instances)
Set the node param --expose-gc and within your code on a regular schedule manually garbage collect global.gc(); (stick it in a setTimeout function at start up of your app)

The issue with doing this is that it is forcing a garbage collection outside what the V8 engine wants to do, and this will momentarily pause your application. Therefore test thoroughly.

marcosnils · 2016-09-13T00:12:41Z

As a note I've experienced the same problem using v11 in AWS lambda with node 4.3. I've downgraded the client to 3.1.4 and it the issue seems to be fixed.

ghost · 2016-09-13T06:56:20Z

It's been some time since I posted to this thread, but I would like to add to my previous comment that we have since moved from Node 5.11.1 to Node 4.3 and our environment is now in production using the 3.1.4 client. The issue persists using the v11 client.

andrewstoker · 2016-09-13T08:52:59Z

Unfortunately this comes down to node.js and lazy garbage collection of old space rather than the elasticsearch client behaviour.

Have you used the node parameter --max-old-space-size in your application? Its what is used under the covers in other areas of elasticsearch to help the garbage collection of old memory space. (i.e kibana was impacted - details elastic/kibana#5595).

I have this set in production in our application (node 4.3) and have had no memory issues with the most recent elasticsearch client (v12) since.

marcosnils · 2016-09-13T11:49:47Z

Unfortunately this comes down to node.js and lazy garbage collection of old space rather than the elasticsearch client behaviour.

I don't think this is the case. I just replaced the ES client without any node configuration or version change and the problem disappeared

andrewstoker · 2016-09-13T14:40:40Z

Yes I agree too that there is an issue if you look at the problem in using an old version of the client. I did spend a lot of time drilling down into the various libraries that were updated since v3 elastic client - however I found myself getting ever deeper with no solution or cause for the memory profile change in sight.
In my use case using the common and well documented node parameter and the benefits of the new client eventually outweighed spending more of my time looking at this.
Maybe @spalger or others on this issue can look into it deeper if the node parameter really is not an option for your use case.

ghost · 2016-09-13T15:30:57Z

@andrewstoker Well used node configuration commands might well provide relief in combination with the latest client, but according to what you said earlier it depends on use cases. I have not yet checked out what the impact of your suggestions are on my particular use case, it might be a solution of sorts. Having said that, for clients beyond v3.4.1 up to and including v11 to have such a negative memory side effect in a standard Node config and using a very simple (currently) low intensity production setup like my own does not seem quite right.

Anyways, thanks for your research and suggestions.

delvedor · 2019-04-01T07:40:54Z

Hello! We have release the new JavaScript client! 🎉
You can install it with npm i @elastic/elasticsearch, give it a try! :)

spalger self-assigned this Apr 2, 2016

spalger added the bug label Apr 2, 2016

dylancwood mentioned this issue Apr 4, 2016

handle node 3+ Eventlistener prototype #381

Closed

cblanc mentioned this issue Dec 22, 2016

Memory Leak #485

Closed

delvedor closed this as completed Apr 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory usage #310

Memory usage #310

carsotho commented Nov 25, 2015

spalger commented Nov 25, 2015

RWOverdijk commented Mar 21, 2016

spalger commented Mar 21, 2016

dylancwood commented Apr 2, 2016

dylancwood commented Apr 2, 2016

spalger commented Apr 2, 2016

dylancwood commented Apr 3, 2016

RWOverdijk commented Apr 3, 2016

andrewstoker commented Jun 21, 2016 •

edited by spalger

Loading

ghost commented Jul 9, 2016 •

edited by spalger

Loading

spalger commented Jul 11, 2016

andrewstoker commented Jul 13, 2016

andrewstoker commented Jul 14, 2016 •

edited

Loading

marcosnils commented Sep 13, 2016

ghost commented Sep 13, 2016 •

edited by ghost

Loading

andrewstoker commented Sep 13, 2016

marcosnils commented Sep 13, 2016

andrewstoker commented Sep 13, 2016

ghost commented Sep 13, 2016

delvedor commented Apr 1, 2019

Memory usage #310

Memory usage #310

Comments

carsotho commented Nov 25, 2015

spalger commented Nov 25, 2015

RWOverdijk commented Mar 21, 2016

spalger commented Mar 21, 2016

dylancwood commented Apr 2, 2016

dylancwood commented Apr 2, 2016

spalger commented Apr 2, 2016

dylancwood commented Apr 3, 2016

RWOverdijk commented Apr 3, 2016

andrewstoker commented Jun 21, 2016 • edited by spalger Loading

ghost commented Jul 9, 2016 • edited by spalger Loading

spalger commented Jul 11, 2016

andrewstoker commented Jul 13, 2016

andrewstoker commented Jul 14, 2016 • edited Loading

marcosnils commented Sep 13, 2016

ghost commented Sep 13, 2016 • edited by ghost Loading

andrewstoker commented Sep 13, 2016

marcosnils commented Sep 13, 2016

andrewstoker commented Sep 13, 2016

ghost commented Sep 13, 2016

delvedor commented Apr 1, 2019

andrewstoker commented Jun 21, 2016 •

edited by spalger

Loading

ghost commented Jul 9, 2016 •

edited by spalger

Loading

andrewstoker commented Jul 14, 2016 •

edited

Loading

ghost commented Sep 13, 2016 •

edited by ghost

Loading