Skip to content

faaany/big-data-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BigDataProject

This repository is for collaboration of our big data project within the "Big Data" course. We used Spark to analyse spotify top100 dataset. The questions we are trying to answer are as below:

  1. Which artist has the most top-rankings?
  2. Who is the most popular artist in the respective regions?
  3. Which song stays longest in the top-ranking?
  4. Which song is on the top 50 list but never on the top 10?
  5. Which song has the highest streams in the last two years?
  6. How long time does a top ranking song takes to get to other countries?
  7. In which region do the top 10 change the most?
  8. For which artists is the variance of streams (per day) the lowest?

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published