Open Source Log Management Solutions

Abstract

Currently there are multiple open-source tools intended for collection, storage and representation of text-based logs. In order to make knowledgeable long-term decisions on monitoring solutions, engineers need to understand differences in feature set, hardware requirements and level of support of those tools.

It is also crucial to understand inter-dependencies between log processing components. In order to design fine-tailored log management system, engineers need to be familiar with existing open-source logging components, their capabilities and operational requirements.

The purpose of this research is to analyze differences between several well known Log Management solutions and present results in a structured way. Results of the investigation should help technical experts to pick an appropriate solution that meets specific requirements.

It should be noted that only free Log Management solutions and their free features were reviewed. Study contains relevant parts that describe differences between tools of similar function.

Log Management Systems

Logs can be described as free-form comments, produced by the software during it's execution. In order to efficiently process and analyze them various open source tools have been created.

Log Management system is a combination of such tools, with core responsibilities mentioned below:

capture of logs from applications (belongs to "Collection" functionality)
storage of log data (belongs to "Storage" functionality)
search/analysis of log data (belongs to "Representation" functionality)

More detailed definition and use cases of Log Management Systems can be found in Sematext Log Management guide and Graylog Log Management guide

Structure of the Research

In order to visualize inter-dependencies between logging components we've created a compatibility map. Components are organized according to their function: collection, storage and representation.

Dependencies graph shows that the most important choice to make is the proper storage service. This will define what collection and representation tools could be reliably integrated into the system.

We define three component's stacks, clustered around storage services: Solr, Elasticsearch and Loki.

Review of the stacks is followed by a description of individual logging components, grouped by it's responsibility: collection, storage or representation.

Tools Compatibility Map

This section represents inter-dependencies between most popular open-source components of logging tools.

Tools are grouped by responsibility: collection, storage or representation.

Outbound arrow represents dependency or ability to be integrated with some component. Multiple outbound arrows represent alternatives that could be used building Log Management System.

For Instance, Fluentd can send collected logs to Elasticsearch, Grafana Loki or to Logstash for further processing and rerouting.

Log Management System is expected to have one representation tool, one storage tool and at least one collection tool.

Stacks Overview

Services that are responsible for logs collection and representation are usually dependent on specific logs storage. Therefore, it could be convenient to group them into solutions (stacks) differentiated by compatibility with certain logs storage:

Solr stack
Elastic stack
Grafana Loki stack

Solr Stack

Unfortunately opensource log representation solutions that are able to work with Solr have a set of critical drawbacks:

Banana has a low activity (~20 commits during last two years) of contributors, which means that it could be troubling to get new features and bug fixes (see github cotributors chart).
Cloudera Hue is able to work with Solr only via it's sql interface, preventing users from utilizing full text search capabilities of the engine (see Hue documentation)

Solr is quite close to Elastic in terms of feature set (for open-source versions) and performance (see Mustafa, 2016), as both are based on Lucene.

Elastic Stack

Tooling in this stack is mostly represented by solutions from Elastic.

Solutions from Elastic are well maintained and cover most of the log sources (see Beats and Logstash).

Elasticsearch is a mature full-text search engine widely adopted by the industry.

Grafana Stack

Grafana Loki is a relatively new product - first beta was released Jun 4, 2019. Version 1.0.0 seen the light Nov 20, 2019

Due to the novelty of the product there aren't lot of compatible logs collection tools. At the moment of writing (Dec 19, 2019) it's Fluentd and Promtail.

There is also a Docker driver that is capable of routing logs directly to Loki instance.

Stacks Summary

Unfortunately the lack of high-quality representation tools renders Solr stack unfit for modern production ready systems.

Grafana Stack is a new, promising player in the field of log-processing. It might be considered as a viable alternative to Elastic-based solutions.

Storage Services Review

This section contains descriptions of services that are responsible for storing/indexing logs.

Elasticsearch

Elasticsearch is a search engine based on the Lucene library. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents. Elasticsearch is developed in Java

Solr

Solr is the fast open source search platform built on Apache Lucene™ that provides scalable indexing and search, as well as faceting, hit highlighting and advanced analysis/tokenization capabilities. Solr and Lucene are managed by the Apache Software Foundation

Grafana Loki

Loki is a horizontally-scalable, highly-available, multi-tenant log aggregation system inspired by Prometheus. It is designed to be very cost effective and easy to operate. It does not index the contents of the logs, but rather a set of labels for each log stream.

Compared to other log aggregation systems, Loki:

does not do full text indexing on logs. By storing compressed, unstructured logs and only indexing metadata, Loki is simpler to operate and cheaper to run.
indexes and groups log streams using the same labels you’re already using with Prometheus, enabling you to seamlessly switch between metrics and logs using the same labels that you’re already using with Prometheus.
is an especially good fit for storing Kubernetes Pod logs. Metadata such as Pod labels is automatically scraped and indexed.
has native support in Grafana (needs Grafana v6.0).
https://github.com/grafana/loki/blob/master/README.md
https://github.com/grafana/loki/tree/master/docs
https://github.com/grafana/loki/tree/master/production/helm

Core Differences

	Elasticsearch	Solr	Grafana Loki
Search Engine	Lucene	Lucene	Loki
Orchestration	-	Zookeeper	Consul
Storage (Cluster Setup)	File System	File System	Index Storage: Amazon DynamoDB, Google BigTable, Apache Cassandra Blob Storage: Amazon DynamoDB, Google BigTable, Apache Cassandra, S3, Google Cloud Storage

Collection Services Review

This section contains descriptions of services that are responsible for collecting, parsing and re-routing logs to logs storage.

Logstash

Logstash is part of the Elastic Stack along with Beats, Elasticsearch and Kibana. Logstash is a server-side data processing service that ingests data from a multitude of sources , transforms it, and then sends it one of the many destinations.

Logstash has over 200+ plugins.

Beats

The Beats are lightweight data shippers, written in Go, that can be installed on servers to capture all sorts of operational data (logs, metrics, or network packet data). The Beats send the operational data to Elasticsearch, either directly or via Logstash.

Fluentd

Fluentd collects events from various data sources and writes them to files, RDBMS, NoSQL, IaaS, SaaS, Hadoop and so on. Fluentd helps you unify your logging infrastructure (Learn more about the Unified Logging Layer)

Promtail

Promtail's mode of operation is to discover log files stored on disk and forward them associated with a set of labels to Loki. Promtail can do service discovery for Kubernetes pods running on the same node as Promtail, act as a container sidecar or a Docker logging driver, read logs from specified folders, and tail the systemd journal.

Graylog

Sources of collected data:

Syslog (TCP/UDP)
GELF (send from app)
Kafka as transport queue
RabbitMQ
Reading from files: Graylog Sidecar https://docs.graylog.org/en/3.1/pages/sidecar.html#graylog-sidecar
and other cases https://docs.graylog.org/en/3.1/pages/sending_data.html

There is also evidence of using Fluentd as a source for Graylog (e.g., https://blog.insiderattack.net/bunyan-json-logs-with-Fluentd-and-graylog-187a23b49540, https://www.devinitiate.com/managing-logs-with-graylog-Fluentd/)

Finally, you can find binding of Fluentd and GELF Plugin (inspired by https://blog.insiderattack.net/bunyan-json-logs-with-Fluentd-and-graylog-187a23b49540):

Core Differences

Below is a comparison of various open-source logs collectors. according to resource requirements, supported sources and downstream destinations.

	Logstash	Beats	Graylog	Fluentd	Promtail
Platform	Java	Go	Java	Ruby	Go
Memory Consumption	high	low	high	medium	low
Alerting
Multiple Destinations

Log Sources Compatibility Map

Most common sources of logs, grouped by producers:

Applications
1. Files
2. Events over TCP
3. Windows Eventlog
Linux OS packages/utilities
1. Syslog over network
2. Journald
Windows OS packages/utilities
1. Windows Eventlog
Streams
1. AMQP
2. Kafka

Log Sources	Logstash	Beats*	Graylog	Fluentd	Promtail
Files
Unstructured events over TCP
GELF over network
Syslog over network
Journald
Windows Eventlog
Kafka
AMQP

* - Beats is a set of tools, rather then a single application

Representation Services Review

This section shows a comparison of various user-facing solutions used as an interface for searching and visualizing logs.

Solutions are compared according to these aspects: search syntax, visualization features, alerting and user management capabilities.

Kibana

Kibana is an open source frontend application that sits on top of the Elastic Stack, providing search and data visualization capabilities for data indexed in Elasticsearch. Commonly known as the charting tool for the Elastic Stack (previously referred to as the ELK Stack after Elasticsearch, Logstash, and Kibana), Kibana also acts as the user interface for monitoring, managing, and securing an Elastic Stack cluster — as well as the centralized hub for built-in solutions developed on the Elastic Stack.

https://github.com/elastic/kibana

Grafana Explore (Logs)

Grafana Explore allows you to dig deeper into your metrics and logs to find the cause.

Grafana's new logging data source, Loki is tightly integrated into Explore and allows you to correlate metrics and logs by viewing them side-by-side.

Graylog

Detailed description of the Graylog representation aspect can be found by the links:

General Features

	Kibana	Graylog	Grafana - Loki
Query Language	Kibana Query Language (close to Lucene syntax)	Close to Lucene syntax	LogQL
Fuzzy Search	(Levenshtein distance)	(Damerau–Levenshtein distance)
Regex Search
Text Normalization
Transform message at display time		(Decorators)
Dashboards
Alerts	(Yelp/elastalert)

AuthN/AuthZ

	Kibana*	Graylog	Grafana - Loki
OAuth2
AD groups
AD users

* - setup without paid X-Pack extension

Additional Representation Services

This section lists services that provide limited feature set or have weak community support (based on github activity).

Grafana Log CLI

LogCLI allows users to run LogQL queries against a Loki server.

https://github.com/grafana/loki/blob/master/docs/getting-started/logcli.md

Banana

The Banana project was forked from Kibana, and works with all kinds of time series (and non-time series) data stored in Apache Solr.

https://github.com/lucidworks/banana

Cloudera Hue

Hue is a web-based interactive query editor that enables you to interact with data warehouses.

Summary

Various open-source log management component were reviewed. This work provides structured description of their responsibilities, features and (inter-)dependencies.

Research is based on analysis of products' documentation. This work provides links to official vendors' web-sites and github, whenever it was necessary to support highlighted components' features.

We hope that this work will help engineers to make educated decisions on Log Management System architecture, build efficient, versatile monitoring infrastructures.

References

[Mustafa, 2016] AKCA, Mustafa & Aydoğan, Tuncay & Ilkuçar, Muhammer. (2016). An Analysis on the Comparison of the Performance and Configuration Features of Big Data Tools Solr and Elasticsearch. International Journal of Intelligent Systems and Applications in Engineering. 4. 8-8. 10.18201/ijisae.271328.

Authors

Vitaliy Levashin, Mihail Klyuykov, Yuriy Chizhov

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
images		images
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md

License

vlev/open-source-log-management-solutions

Folders and files

Latest commit

History

Repository files navigation

Open Source Log Management Solutions

Abstract

Log Management Systems

Structure of the Research

Tools Compatibility Map

Stacks Overview

Solr Stack

Elastic Stack

Grafana Stack

Stacks Summary

Storage Services Review

Elasticsearch

Solr

Grafana Loki

Core Differences

Collection Services Review

Logstash

Beats

Fluentd

Promtail

Graylog

Core Differences

Log Sources Compatibility Map

Representation Services Review

Kibana

Grafana Explore (Logs)

Graylog

General Features

AuthN/AuthZ

Additional Representation Services

Grafana Log CLI

Banana

Cloudera Hue

Summary

References

Authors

About

Resources

License

Stars

Watchers

Forks

Contributors 2