Skip to content

Latest commit

 

History

History
13 lines (11 loc) · 691 Bytes

README.md

File metadata and controls

13 lines (11 loc) · 691 Bytes

BigDataProject

This repository is for collaboration of our big data project within the "Big Data" course. We used Spark to analyse spotify top100 dataset. The questions we are trying to answer are as below:

  1. Which artist has the most top-rankings?
  2. Who is the most popular artist in the respective regions?
  3. Which song stays longest in the top-ranking?
  4. Which song is on the top 50 list but never on the top 10?
  5. Which song has the highest streams in the last two years?
  6. How long time does a top ranking song takes to get to other countries?
  7. In which region do the top 10 change the most?
  8. For which artists is the variance of streams (per day) the lowest?