dirty_data_project

CodeClan's dirty data project involving cleaning and tidying (very) messy data.

Task 4 - Halloween Candy

We are dealing with data obtained from three independent surveys.

These surveys were carried on in years 2015, 2016 and 2017.

Respondents from all over the world were ask to rank various candies based on their subjective opinion.

They were offered three options as an answer - joy, despair, or meh

There were also additional questions like what was the respondents' age, what was their country of origin, whether they prefer name Betty or Veronica etc. These questions had no default set of answers and it was up to the respondents to type in whatever they found appropriate.

Some candy types, as well as additional questions, did only appear in one or two surveys.

As a result, the raw data entering the analysis differed significantly and required some deep cleaning first.

More information on the data can be found here.

The project structure:

Data cleaning

data_cleaning_script_task_4.R - combining the three data frames together and saving the cleaned data into a candy_clean.csv file

Data analysis

candy_analysis.Rmd - reading in the candy_clean.csv file and carrying out an analysis

Both files are thoroughly commented on what steps have been taken.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
task_1		task_1
task_4		task_4
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dirty_data_project

Task 4 - Halloween Candy

The project structure:

Data cleaning

Data analysis

About

Releases

Packages

Languages

LenkaRo/dirty_data_project

Folders and files

Latest commit

History

Repository files navigation

dirty_data_project

Task 4 - Halloween Candy

The project structure:

Data cleaning

Data analysis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages