Skip to content

Latest commit

 

History

History
49 lines (30 loc) · 1.81 KB

README.md

File metadata and controls

49 lines (30 loc) · 1.81 KB

StackOverflow Survey 2019 & 2020 Analysis

Table of Contents

  1. Installation
  2. Project Motivation
  3. File Descriptions
  4. Results
  5. Licensing, Authors, Acknowledgements

Installation

  • For this analysis, I had used two datasets from StackOverflow survey (2019 and 2020) to compare and see the trends over a couple of years. Please find the link to download StackOverflow dataset here

  • The libraries required to run this project are:

    pandas

    numpy

    matplotlib

    seaborn

    pandas_profiling

Project Motivation

I took this dataset for my project as I was interested in understanding and answering the following questions:

  1. Which languages saw the rise in their usage the most from 2019-2020?
  2. Has the deciding factor 'Remote Work / Work from home' has increased over the years and due to the pandemic?
  3. When did most people write their first line of code?
  4. Which group of people use stackoverflow for their work the most?
  5. What factors associate with high salary?

File Descriptions

stackoverflowsurvey-analysis.ipynb: This is notebook where I explore, wrangle the data and try to extract the answers to the questions above. I have added comments for a better understanding of the thought process for my individual steps.

Results

The main findings of the code can be found at the post available on Medium post here.

Licensing, Authors, Acknowledgement

Thanks to Stack Overflow for providing the dataset