Skip to content

Dataset presenting a few results of the yearly Stack Overflow Developer Survey. Emphasis on salary and job satisfaction issues.

Notifications You must be signed in to change notification settings

HH2805/Salaries-and-Job-Satisfaction-of-Data-Professionals-Worldwide

Repository files navigation

Visualizing Salaries and Job Satisfaction of Data Professionals Worldwide

Overview

The goal of this project is to practice data cleaning, creating and interpreting different types of visualizations, and to present all of it effectively.

I have chosen to study the results of the Stack Overflow worldwide yearly 'Developer Survey'. My dataset is an extract of the 2017, 2018, 2019 and 2020 survey results. It was downloaded from Kaggle.

Each row of the dataset is the response of one data professional with regard to country, job position, salary, job satisfaction, and more.

The initial dataset had 33,600 observations (respondents from an initial set of 177 countries) and 14 features. I have downssized it to 22,660 rows/respondents and 8 features, mainly by:

  • reducing the scope of the response panel to those countries which are represented in the 4 yearly editions of the survey with at least 30 respondents each year;
  • discarding all respondents who have not provided their salary details.

Findings

I have studied: 1/ the relationship between salary and other features such as country and job position. My assumption that the main connection would be between salary and country turned out to be true. 2/ connections between job satisfaction and salary or job position; this told me there is NO strong connection between them.

Also I have calculated confidence intervals for the mean salary of the general population of US data professionals. I have used the stats.t.interval method from Python. Finally I have assumed a $105,000 mean salary for all US data professionals and have used the one-sample ttest method to test this assumption. The test showed that a $105,000 salary is a credible assumption.

Technical info / Deliverables

The dataset can be found at https://www.kaggle.com/phuchuynguyen/salary-and-moredata-scientist-analyst-engineer I have used Python to prepare the data and for statistical testing. I have used Tableau Public for visualizing and interpreting data. A github repo has been created specifically for this project, and can be found at https://github.com/HH2805/Data-Professionals---m2-final-project The repo includes:

About

Dataset presenting a few results of the yearly Stack Overflow Developer Survey. Emphasis on salary and job satisfaction issues.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published