Skip to content

Sven-Bo/data-analysis-python

Repository files navigation

Solve Real-World Data Science Tasks in Python | Data Analysis with Pandas & Plotly (Full Tutorial)

In this tutorial, we are going to solve a real-world data science/analysis project with python.

We will be using the following Python Libraries:

  • Pandas
  • Pandas Profiling Report
  • AutoViz
  • Plotly

After we have loaded the dataset, we will do some initial exploratory data analysis to get an idea of the dataset. I am going to show you very useful pandas’ functions which you can apply to any kind of dataset you might deal with.

However, nowadays there are so many cool libraries available, which will make exploratory data analysis so much easier. I will show you my favorite 2 libraries, which will generate automated reports for us in just a few lines of code. Those reports are a great starting point before we are moving on to answer real-world business type questions.

While answering those questions, we will cover a wide range of various pandas’ functions. Additionally, we will also code our own python helper function, which we are going to use in the deep-dive & visualization section. All the charts we are going to create will be interactive and have a clean design.

We will cover the following chart types:

  • Histogram
  • Box Plot
  • Bar Charts
  • Scatter Plot
  • Line Chart

Feel free to code along with me. In the project files, you will also find an exercise Notebook that includes all the tasks we are going to solve.

Video Tutorial

YouTube Video

Requirements

autoviz==0.0.81
numpy==1.19.3
openpyxl==3.0.5
pandas==1.2.0
pandas-profiling==2.9.0
plotly==4.14.1
plotly-express==0.4.1
xlrd==2.0.1

🤓 Check Out My Excel Add-ins

I've developed some handy Excel add-ins that you might find useful:

  • 📊 Dashboard Add-in: Easily create interactive and visually appealing dashboards.
  • 🎨 Cartoon Charts Add-In: Create engaging and fun cartoon-style charts.
  • 🤪 Emoji Add-in: Add a touch of fun to your spreadsheets with emojis.
  • 🛠️ MyToolBelt Add-in: A versatile toolbelt for Excel, featuring:
    • Creation of Pandas DataFrames and Jupyter Notebooks from Excel ranges
    • ChatGPT integration for advanced data analysis
    • And much more!

🤝 Connect with Me

☕ Support

If you appreciate the project and wish to encourage its continued development, consider supporting my work. ko-fi

Feedback & Collaboration

For feedback, suggestions, or potential collaboration opportunities, reach out at [email protected]. Logo

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published