Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EDA-StackOverflowSurvey2023- Add new learning #1085

Closed
4 tasks
Charvi-14 opened this issue Oct 2, 2024 · 4 comments · Fixed by #1089
Closed
4 tasks

EDA-StackOverflowSurvey2023- Add new learning #1085

Charvi-14 opened this issue Oct 2, 2024 · 4 comments · Fixed by #1089
Assignees

Comments

@Charvi-14
Copy link
Contributor

Stack Overflow Survey 2023 - EDA Guide

Description

To add learning material that guides users on how to perform an Exploratory Data Analysis (EDA) on the Stack Overflow Survey 2023 data. This will help users understand the data better and improve their analysis skills. The material covers key aspects such as:

  • Understanding Data Structure:
    How to load and explore the dataset to get insights into the variables, patterns, and distributions.

  • Data Cleaning and Preparation:
    Identifying missing values, outliers, and inconsistencies, and introducing techniques to clean the data effectively.

  • Feature Engineering and Selection:
    Methods for selecting the most relevant variables to improve analysis or model performance.

  • Visualization Techniques:
    Covering different types of plots (bar plots, scatter plots, heatmaps, etc.) and how to interpret them to find trends and relationships.

  • Handling Correlations:
    Techniques for understanding and dealing with correlations between features.

  • Domain-Specific Insights:
    Focused analysis on relevant trends in the developer community, such as technology preferences, geographical trends, and developer satisfaction.

This will improve the overall user experience when analyzing the survey data and help users make more data-driven decisions.


Tasks

  • Create a comprehensive EDA guide for Stack Overflow Survey 2023.
  • Add examples of common plots and how to interpret them.
  • Split Multiselect Column into Multiple Boolean Columns
  • Update the README with a section on how to use the new EDA material.

Assign To

Charvi Arora

Copy link

github-actions bot commented Oct 2, 2024

Thanks for creating the issue,Please read the Pinned issued first and Readme.md in each Pull Request you made. Keep learning...

@Charvi-14
Copy link
Contributor Author

I am a GGSOC24 Extd. Contributor
I would appreciate if u allow me to add value to the ML-CaPsule
Understanding data is the first step in ML

@Charvi-14
Copy link
Contributor Author

Charvi-14 commented Oct 4, 2024 via email

@Niketkumardheeryan
Copy link
Owner

Should i start work on the issue?

On Wed, Oct 2, 2024 at 5:08 PM github-actions[bot] @.> wrote: Thanks for creating the issue,Please read the Pinned issued first and Readme.md in each Pull Request you made. Keep learning... — Reply to this email directly, view it on GitHub <#1085 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AZTLHHF7454QAXEZ5EWRRNLZZPLK5AVCNFSM6AAAAABPHPRWKWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGOBYGQZTANJXHE . You are receiving this because you authored the thread.Message ID: @.>

You can start working

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants