Skip to content

Master Project Distributed Systems: Monitoring of Scientific Workflows

License

Notifications You must be signed in to change notification settings

maseiler/swms-monitoring

Repository files navigation


Logo

Extended Metrics Project for Apache Airflow

Gain more insight in your scientific workflow management system and optimize workflow execution while minimizing costs.

Explore the project report for more detailed information »

View Demo »

About The Project

This is a repository for the university project "Master Project: Distributed Systems - Monitoring of Scientific Workflows" attended during the summer term 2022 at the Technical University Berlin. In this project we should gain practical experience with so called Scientific Workflow Management Systems (SWMS) and extend existing ones with additional functionalities to give them extra value. In our subproject we extend the SWMS Apache Airflow monitoring capabilities with the following capabilities:

  • make use of the Extended Berkeley Packet Filter (eBPF) to get low level kernel space information and process them to gain more insight into the Airflow Tasks

The project should use a semi realistic environment and is therefore settled in the kubernetes ecosystem to reflect the actual real world scenarios with huge workloads in highly distributed systems.

(back to top)

Getting Started

We try to make the deployment as simple as possible and therefore using kubernetes and helm to deploy a working prototype to the Google Cloud Plattform.

To get a copy up and running follow these simple example steps.

Prerequisites

Deploy

Simply run ./setup.sh in the cloned repository directory.

Deprovision

Run ./deprovision.sh to prune all artificats and confirm with y.

(back to top)

Usage

When deployed, the important services are passed through kubectl to localhost and can be accessed through the following addresses:

The login credentials for the Airflow UI and Grafana are by default the following:

  • Username: pjds
  • Password: pjds

(back to top)

License

Distributed under the MIT License. See LICENSE for more information.

(back to top)

About

Master Project Distributed Systems: Monitoring of Scientific Workflows

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published