Skip to content

Latest commit

 

History

History
126 lines (75 loc) · 7.49 KB

File metadata and controls

126 lines (75 loc) · 7.49 KB

Useful resources for Data Engineers

Being a good specialist means that you need to be familiar with the latest trends in your field of work. Especially if we are talking about Data Engineering and Data Science, those fields are changing rapidly nowadays.

In this article I collected a list of handpicked resources I'm following in order keep up with the pace.

Newsletters

Newsletters are a great way of being informed about news without spending too much time.


First very nice newsletter is The Data Science Roundup by Tristan Handy, founder Fishtown Analytics and dbt tool . "The internet's most useful data science articles". One of my favorite resource from all presented here. Tristan share a couple of useful links each week on data science and data engineering topics.


Second newsletter is Counting Stuff by Randy Au. Randy writes his own thoughts on different aspects of data. Sometimes emails are more technical, sometimes less. But overall it gives me a lot of food for thoughts.


Data Engineering Weekly by Ananth Packkildurai. Relatively new newsletter, started in July 2020.

Ananth shares a lot of high quality links about different aspects of data engineering. What I like most about this newsletter is that it covers not only technical topics, like tools or technologies, but also some higher level topics like data quality, data discovery, etc.


Seattle Data Guy by Benjamin Rogojan. This newsletter takes another approach to content. Each issue is a separate story where author shares his ideas about the topic. Just look at this one about Data Strategy for 2021 or this one about ETL.


Another gem is DataElixir newsletter. Covers a lot of data topics, from Data Science to Data Engineering. Absolutely recommend to stay on the edge of latest news in data.

Blogs

Blogs require more time to spend on them, but overall they give you more detailed information and even step-by-step guides.


Start Data Engineering is a relatively new blog, but provides very quality content. You can read about some high-level topics like What is a Data Warehouse or Why to use Staging as well as about more practice-oriented tutorials like How to pull data from API using AWS Lambda.


Andreas Kretz is a data engineer from Germany who posts on Medium, writes his own Data Engineering book and have a YouTube channel. Talk about different topics of DE and DS.


Locally Optimistic writes about many aspects of data, including data engineering. Overall a good resource to rest from strictly technical articles and switch to soft-skill and more philosophical topics.


My recent finding is Alisa in Techland blog. Alisa has some nice in-depth articles about Data Engineering. Her motto is "I believe that data engineering can and should be more approachable". And I can't agree more :)

Lastly, pay attention to the blogs of some tech companies. Don't hesitate to check them all:

Videos

Here I will recommend some YouTube channels worth checking. Some of them are pure experience, some are nice guides and tutorials.


Data Council has "the best data-oriented tech talks from the top engineers from top technology companies across the world". The content sometimes "heavy" for beginners, but more seasoned developers with find it very useful.


Databricks is a company behind Apache Spark computing framework and some other data-related technologies. The channel has many videos about Big Data, Machine Learning and AI. Their headline conference "Data + AI summer 2020" was purely online so you can check a lot of good talks from smartest people. Must see if you like to know more about Hadoop technilogies.


PyCon and PyData are two conferences about Python programming language. First one is more about engineering, second has a strong data shift.

PyCon has no dedicated channel, each edition of the conference lives on a separate channels, for example:

For PyData there is a dedicated channel here.

Podcasts

Sometimes you don't have possibility to read something, or you want to take a break from the monitor. For you I have another medium – podcasts :)


Data Engineering Podcast. Discussions about various data-related topics with interesting guests.


Databricks started its own video-podcast called Data Brew. So far, covered topics include Data lakes, Delta lakes, BI on Delta lakes and Spark (basically everything that Databricks is working on). So if you are a Big Data engineer you definitely find those podcasts useful.

Awesome lists

Lastly, when you want to discover something new, unexpected or inspirational, try some of the Awesome lists below:


Awesome