Skip to content

Installation

YanaZhivkova edited this page Sep 13, 2021 · 44 revisions

Install SDK

Versatile Data Kit SDK (vdk) is the go to tool for developing and running data jobs locally.

Prerequisites

Versatile Data Kit CLI (VDK) requires Python 3.7+. If you're new to Python, we recommend Anaconda.

Installation

pip install quickstart-vdk

This will install VDK with support for some common databases and job lifecycle management operations.

See help to see what you can do:

vdk --help

Check out Getting Started to create your first data job.

Optional features

VDK comes with a number plugins that can be installed to change or extend its behaviour.

pip install vdk-plugin-name

See plugins page for more info.

Install Versatile Data Kit Control Service

Versatile Data Kit includes Control Service server enabling deploying, managing and monitoring data jobs.

Install locally

Prerequisites

Then run:

   vdk server --install 

This will install Control Service on your local machine and configure local installation of vdk to use it.

Install in Production

In production, use the helm chart to install and configure it.

Prerequisites

  • Install helm
  • Kubernetes

Then run

helm repo add vdk-gitlab https://gitlab.com/api/v4/projects/28814611/packages/helm/stable
helm install my-release vdk-gitlab/pipelines-control-service 

Read more about the installation in the Versatile Data Kit Control Service Chart Github repository

Use

# see Help to see what you can do and start playing around
vdk --help 
# for example create a new job:
vdk create -u <URI-YOU-GOT-FROM-helm-install-OUTPUT>
Clone this wiki locally