Fleet managed: enable debug mode #143

mtojek · 2021-10-13T11:24:30Z

Hi Team,

while investigating the root cause of elastic/integrations#1566 , we confirmed that's really useful to enable debug logs for filebeat and metricbeat running as Docker containers (under CI). The ideal option would involve an extra property in policy to enable debug logging or at least a special Docker image ENV which can be hardcoded in system tests.

elasticmachine · 2021-10-13T11:24:40Z

Pinging @elastic/elastic-agent-control-plane (Team:Elastic-Agent-Control-Plane)

mtojek · 2021-10-27T07:08:47Z

Hey! Is there an option to give some priority? It might be a useful feature for supporting customers once we're GA. Any workaround would be also appreciated.

jlind23 · 2021-10-28T16:04:57Z

Assigning @michel-laterman to have more insight as he worked on the diagnosis topic. As soon as we have a clear target on it we'll move forward.
(fyi @nimarezainia )

michel-laterman · 2021-10-28T22:33:16Z

We can pass the logging.level setting to beats/processes

jlind23 · 2021-10-29T07:41:44Z

Should we add it in the same PR as ECS logging? elastic/beats#28573
wdyt @michel-laterman

mtojek · 2021-10-29T08:20:51Z

We can pass the logging.level setting to beats/processes

Sounds good, but to be complete we need to control the log level with ENVs injected to the Docker image. Is this option available?

michel-laterman · 2021-10-29T16:32:25Z

There's some support for env vars now (https://www.elastic.co/guide/en/beats/metricbeat/7.15/using-environ-vars.html) however it needs changes made to the config file used in order to work, is that acceptable for the time being?

mtojek · 2021-11-02T07:01:22Z

This is the way we start elastic-agent and fleet-server:
https://github.com/elastic/elastic-package/blob/9e5e39f0bfcdd0d92439384b4e86836da47b1391/internal/profile/_static/docker-compose-stack.yml#L94

We depend on environment variables only.

michel-laterman · 2021-11-03T18:30:43Z

After a chat with @ruflin, I think that we agreed the best approach to take was to add the ability to define logging levels to the policy instead of just passing the agent's level to the process. This way we can use environment variables with a default, for example (syntax may not be correct)

log.level: ${CUSTOM_LOG_LEVEL:'info'}

ruflin · 2021-11-04T13:19:37Z

@joshdover I wonder if we should use an environment variable with the preset values as the default from Fleet. This means, the policy sent down by Fleet would always contain something like log.level: ${env.ELASTIC_AGENT_LOG_LEVEL|'error'} in the case of error (syntax may also not be correct).

Now what happens if a user was overwriting the log level for an elastic agent from Fleet? Which one wins?

joshdover · 2021-11-15T14:26:44Z

I wonder if we should use an environment variable with the preset values as the default from Fleet. This means, the policy sent down by Fleet would always contain something like log.level: ${env.ELASTIC_AGENT_LOG_LEVEL|'error'} in the case of error (syntax may also not be correct).

Seems reasonable to me. Let us know if we should open an issue to start tracking this.

Now what happens if a user was overwriting the log level for an elastic agent from Fleet? Which one wins?

Do we have precedence with any other settings? I sorta expect the local agent config to override the managed config for simple debugging purposes. I don't think there would be any security risk here since the user needs root access on the machine to edit the local configuration, which means they likely have access to everything that Agent is collecting data from.

jlind23 · 2021-11-15T15:38:09Z

Now what happens if a user was overwriting the log level for an elastic agent from Fleet? Which one wins?

Do we need to always have the same winner? Shouldn't we take the most verbose for diagnosis purpose?

ruflin · 2021-11-16T08:59:20Z

@joshdover Sounds like this is the issue to track this? You mean we need an additional issue in Beats / Kibana?

@jlind23 I expect us to have many more configs where this logic applies. Instead of having a case by case logic I rather have a principle in place do always have the same behaviour.

I like the idea that the most local one wins. Basically the order would be:

local config > fleet config > template default

jlind23 · 2021-11-16T09:25:16Z

Local first suits me well then! 👍🏼

joshdover · 2022-01-05T13:08:03Z

Sounds like this is the issue to track this? You mean we need an additional issue in Beats / Kibana?

Yes, I was asking if we should open an Fleet UI issue for making this the default value: log.level: ${env.ELASTIC_AGENT_LOG_LEVEL|'error'}. But given the precedence discussion, should this just always be the behavior from the Agent side rather than requiring this in the configuration yaml directly?

ruflin · 2022-01-10T09:50:33Z

I think we need both. Making it a default in the Elastic Agent but also when shipped down from the policy as the policy will overwrite it.

axw · 2022-02-17T10:15:22Z

Being able to enable debug logging is important for APM Server too, as we are going all in on Fleet. If users can't enable debug logging we're going to have a harder time debugging issues.

Ideally we would also be able to set logging.selectors, as turning on debug level logging for everything tends to be overwhelming.

joshdover · 2022-02-17T15:42:46Z

I've opened elastic/kibana#125956 to track this on the Fleet UI side.

Following from @axw's suggestion above, it seems we're likely to want to expose more than just logging.level to inputs. Instead of adding an env var for each one, should we instead extend the agent context provider to allow inputs to read the overall agent logging configuration? This would allow a policy like:

id: my-policy
agent:
  monitoring:
    # ...
outputs:
  # ...
fleet.hosts: ''
inputs:
  - id: <uuid>
    type: logfile
    logging:
      level: "${agent.logging.level | 'error'}"
      selectors: "${agent.logging.selectors | '[beat]'}"

TBH I'm still unclear on why Fleet needs to provide this in the policy and the default can't be part of the Elastic Agent logic. What wouldn't be possible if the default came from the policy? The only thing I can think of is whenever we get around to adding support for variables and conditions or global variables that this would be necessary.

jlind23 · 2022-02-21T14:04:41Z

@ph would it be possible for you to give a first stab as designing it?

ph · 2022-02-22T21:31:51Z

I need bit more details, looking at the original description and the above information we are looking for a way to define using the environment variable a new log level or a new selector or both. Looking at this comment we are looking at log level per input?

What is the actual need per input or that we can specify the log level and at the global of the agent policy? I am asking this for a few reasons:

I am not sure yet how logging will work with v2 input, so adding a new field that we would need to support make be a bit uneasy.
What happens when there are multiple inputs for logs that define different log level or even different selectors?

jlind23 · 2022-03-07T14:58:33Z

If there are inputs with different log levels then I think we should take the most verbose one.

oren-zohar · 2022-04-28T09:31:18Z

The debug logging option is also crucial for cloudbeat. We are less concerned about log level per input and even a log level that's inherited from the agent log level / env var would be helpful. Maybe we can start with that and reiterate once we have a complete definition?
I would happy to help with implementing some basic way of controlling the log level to speed things up.

jlind23 · 2022-04-28T09:47:26Z

@oren-zohar starting with a global log level sounds good as a first step. If you give a first try at it let me know and i'll find someone to help you out if needed.

jlind23 · 2024-05-14T07:06:37Z

Closing this as duplicate as it will be covered by #3090
cc @ycombinator

mtojek added the Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team label Oct 13, 2021

jlind23 assigned michel-laterman Oct 28, 2021

michel-laterman mentioned this issue Oct 28, 2021

Pass logging.level from elastic agent to processes. elastic/beats#28707

Closed

6 tasks

jlind23 added the good first issue Good for newcomers label Nov 2, 2021

jlind23 added 8.1-candidate v8.1.0 and removed v8.1.0 labels Nov 9, 2021

jlind23 added v8.1.0 and removed 8.1-candidate labels Dec 7, 2021

jlind23 unassigned michel-laterman Dec 13, 2021

jlind23 added 8.3-candidate and removed v8.1.0 labels Jan 17, 2022

joshdover mentioned this issue Feb 17, 2022

[Fleet] Add support for input logging configuration to Agent Policies elastic/kibana#125956

Open

jlind23 assigned ph Feb 21, 2022

jlind23 transferred this issue from elastic/beats Mar 7, 2022

jlind23 added v8.3.0 and removed 8.3-candidate labels Mar 23, 2022

jlind23 removed the v8.3.0 label May 18, 2022

renzedj mentioned this issue Sep 8, 2022

Set default log level for gathering Elastic Agent logs for all agents in policy #1129

Closed

jlind23 closed this as not planned Won't fix, can't repro, duplicate, stale May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fleet managed: enable debug mode #143

Fleet managed: enable debug mode #143

mtojek commented Oct 13, 2021

elasticmachine commented Oct 13, 2021

mtojek commented Oct 27, 2021 •

edited

Loading

jlind23 commented Oct 28, 2021

michel-laterman commented Oct 28, 2021

jlind23 commented Oct 29, 2021 •

edited

Loading

mtojek commented Oct 29, 2021

michel-laterman commented Oct 29, 2021

mtojek commented Nov 2, 2021

michel-laterman commented Nov 3, 2021

ruflin commented Nov 4, 2021

joshdover commented Nov 15, 2021

jlind23 commented Nov 15, 2021 •

edited

Loading

ruflin commented Nov 16, 2021

jlind23 commented Nov 16, 2021

joshdover commented Jan 5, 2022

ruflin commented Jan 10, 2022

axw commented Feb 17, 2022

joshdover commented Feb 17, 2022

jlind23 commented Feb 21, 2022

ph commented Feb 22, 2022

jlind23 commented Mar 7, 2022

oren-zohar commented Apr 28, 2022

jlind23 commented Apr 28, 2022

jlind23 commented May 14, 2024

Fleet managed: enable debug mode #143

Fleet managed: enable debug mode #143

Comments

mtojek commented Oct 13, 2021

elasticmachine commented Oct 13, 2021

mtojek commented Oct 27, 2021 • edited Loading

jlind23 commented Oct 28, 2021

michel-laterman commented Oct 28, 2021

jlind23 commented Oct 29, 2021 • edited Loading

mtojek commented Oct 29, 2021

michel-laterman commented Oct 29, 2021

mtojek commented Nov 2, 2021

michel-laterman commented Nov 3, 2021

ruflin commented Nov 4, 2021

joshdover commented Nov 15, 2021

jlind23 commented Nov 15, 2021 • edited Loading

ruflin commented Nov 16, 2021

jlind23 commented Nov 16, 2021

joshdover commented Jan 5, 2022

ruflin commented Jan 10, 2022

axw commented Feb 17, 2022

joshdover commented Feb 17, 2022

jlind23 commented Feb 21, 2022

ph commented Feb 22, 2022

jlind23 commented Mar 7, 2022

oren-zohar commented Apr 28, 2022

jlind23 commented Apr 28, 2022

jlind23 commented May 14, 2024

mtojek commented Oct 27, 2021 •

edited

Loading

jlind23 commented Oct 29, 2021 •

edited

Loading

jlind23 commented Nov 15, 2021 •

edited

Loading