Skip to content

CodeClan's dirty data project involving cleaning and tidying (very) messy data. Includes analysis insight report

Notifications You must be signed in to change notification settings

LenkaRo/dirty_data_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

dirty_data_project

CodeClan's dirty data project involving cleaning and tidying (very) messy data.

Task 4 - Halloween Candy

We are dealing with data obtained from three independent surveys.

These surveys were carried on in years 2015, 2016 and 2017.

Respondents from all over the world were ask to rank various candies based on their subjective opinion.

They were offered three options as an answer - joy, despair, or meh

There were also additional questions like what was the respondents' age, what was their country of origin, whether they prefer name Betty or Veronica etc. These questions had no default set of answers and it was up to the respondents to type in whatever they found appropriate.

Some candy types, as well as additional questions, did only appear in one or two surveys.

As a result, the raw data entering the analysis differed significantly and required some deep cleaning first.

More information on the data can be found here.

The project structure:

Data cleaning

Data analysis

Both files are thoroughly commented on what steps have been taken.

About

CodeClan's dirty data project involving cleaning and tidying (very) messy data. Includes analysis insight report

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published