Skip to content

dpfister22/LL-data-pipeline-automation-with-github-actions

 
 

Repository files navigation

Data Pipeline Automation with GitHub Actions

This is the repository for the LinkedIn Learning course Data Pipeline Automation with GitHub Actions. The full course is available from LinkedIn Learning.

lil-thumbnail-url

In this course, learn how to set up workflows on GitHub Actions to automate processes with both R and Python. Instructor Rami Krispin takes you through the automation process, sharing real-world examples. He shows you how to set up a data pipeline, pull metadata from a pipeline, and deploy a live dashboard with GitHub Actions and Pages. If you’re tired of spending hours running scripts manually, or slowing down your workflow by pulling data from APIs or updating dashboards, join Rami in this course to see how automation can speed up your work.

See the readme file in the main branch for updated instructions and information.

Instructions

Fork this repo to follow along with the course. The course has tracks for R and Python users, and you can choose to follow one or both. The R code examples are available under the R folder; likewise, the Python code examples are available under the python folder.

This repo has VScode setting to launch the repo inside a Docker container using the Dev Containers extension. The course image was built to support amd64 CPU architecture in line with the GitHub Actions container support. Alternatively, you can install locally the required R or Python requirements using the required_packages.R for R and requirements.txt for Python.

Throughout the course, we will work with the EIA API to pull data and metadata. The EIA API is open and free, and an API key is required to access it. To register to the API and set your key, go to the EIA website and follow the registration instructions.

For learning purposes, we store the data pipeline outputs and metadata locally, in the csv and metadata folders.

Instructor

Rami Krispin

Senior Manager, Data Science and Engineering

Check out my other courses on LinkedIn Learning.

About

This repo is for LinkedIn Learning course: Data Pipeline Automation with GitHub Actions

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 86.2%
  • Jupyter Notebook 12.4%
  • JavaScript 1.3%
  • Python 0.1%
  • R 0.0%
  • SCSS 0.0%