Support log level setting from policy #3090

pchila · 2023-07-17T12:43:02Z

What does this PR do?

Support setting elastic-agent log level from fleet policy.
The agent will apply (in decreasing order of priority):

log level set specifically to the agent via settings action, if any
log level specified in fleet policy, if any
default hard-coded log level for elastic-agent

Whenever a policy_change or settings action is received, the settings action handler will reevaluate the loglevels specified and set the log level according to the priority above.

Why is it important?

It allows users to manage elastic-agent verbosity easily through the fleet policy, while at the same time allowing to set a different log level to specific agents for troubleshooting issues.

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
~~[ ] I have made corresponding changes to the documentation~~
~~[ ] I have made corresponding change to the default configuration files~~
~~[ ] I have added tests that prove my fix is effective or that my feature works~~
I have added an entry in ./changelog/fragments using the changelog tool
~~[ ] I have added an integration test or an E2E test~~

Author's Checklist

How to test this PR locally

Create a simple policy in fleet and enroll an elastic-agent with that policy.
The start setting log levels in policy and for the specific agent using dev tools and the requests below

Set policy log level

PUT kbn:/api/fleet/agent_policies/<policy id>
{
   "name": "<policy name>",
   "namespace": "default",
   "overrides": {
       "agent":{
         "logging": {
           "level": "error"
         }
       }
   }
}

Set log level for a specific agent

POST kbn:/api/fleet/agents/<elastic agent id>/actions
{
  "action": {
    "type": "SETTINGS",
    "data": {
     "log_level": "debug"   
    }
  }
}

A good way to check the effect of the changes of log levels is to keep a terminal with elastic-agent logs -f command running: that way we can see the changes in elastic-agent logging in real-time.

Another way to check the current log level currently in use by agent is to use the inspect subcommand (grepping or using yq is recommended as the output is very verbose, for example:

sudo elastic-agent inspect | yq .agent

download:
  sourceURI: https://artifacts.elastic.co/downloads/
features: null
headers: null
id: 25bd1b94-9b76-4a2d-bdd8-09d778e8cf44
logging:
  level: warning
monitoring:
  enabled: true
  http:
    buffer: null
    enabled: false
    host: localhost
    port: 6791
  logs: true
  metrics: true
  namespace: default
  use_output: default
protection:
  enabled: false
  signing_key: <redacted>

Related issues

Closes Configuring agent.logging.level through agent policy do not work for managed agents #2851

Use cases

Screenshots

Logs

Questions to ask yourself

How are we going to support this in production?
How are we going to measure its adoption?
How are we going to debug this?
What are the metrics I should take care of?
...

mergify · 2023-07-17T12:43:38Z

This pull request does not have a backport label. Could you fix it @pchila? 🙏
To fixup this pull request, you need to add the backport labels for the needed
branches, such as:

backport-v./d./d./d is the label to automatically backport to the 8./d branch. /d is the digit

NOTE: backport-skip has been added to this pull request.

internal/pkg/agent/application/actions/handlers/handler_action_policy_change.go

elasticmachine · 2023-07-17T13:07:33Z

💔 Tests Failed

the below badges are clickable and redirect to their specific view in the CI or DOCS

Expand to view the summary

Build stats

Start Time: 2023-07-17T12:43:18.470+0000
Duration: 24 min 5 sec

Test stats 🧪

Test	Results
Failed	20
Passed	5955
Skipped	27
Total	6002