Skip to content

bpanda-dev/DS-200-Research-Methods-Plot-Assignment

Repository files navigation

DS 200 Research Methods Plot Assignment

This Repository is for the credit requirement of DS 200: Research Methods. All the plots in this repository was made possible with the data from https://data.gov.in A Digital India Initiative. All the data used here is only relevant to India.

Zip file with all the Plots and Codes DS_200_Research_Methods_Plot_Assignment.zip

Bar plot

Data description:

State/UT wise Accidents Victims Killed according to classification of Age and Sex during 2017 [1]

This data contains state wise statistics about number of fatal road accidents that happened in the year 2017. The data is further categorized by Age groups and Sex.

Below is the screen shot of the columns in the data.

Dataset 1 alt text

Plot:

Below is the plot comparing number of road accidents of different Age groups and between different Sex.

alt text

Observations:

  1. The number of men killed in road accidents is more than number of women killed for every age group.(This could be due to the fact that there are more male drivers than female drivers.)
  2. By carefully observing the two largest age groups 18-25 age group and 25-35 age group, we can conclude that the Average number of Accidents per "year age" in the age group 18-25 (Male:30148/7 = 4307 | Female: 4096/7 = 585) is the greater than the age group 25-35 (Male: 34728/10 = 3473 | Female: 4821/10 = 482). Infact it is the highest among all age groups, regardless of the gender
  3. So,we can say younger inexperienced drivers are the cause for higher rate of accidents across both genders.

Box plot

Data description:

State/UT wise Per Capita Consumption of Petroleum Products from 2012-13 to 2018-19

This data contains state/UT wise statistics about consumption of Petroleum Products from 12-13 to 18-19.

Below is the screen shot of the columns in the data.

Dataset 2

alt text

Note: Telangana data is not available for 2012-13 and 2013-14 because it was formed in June 2014.So I copied Andhra Pradesh's data into it.

Plot:

Below is the Box plot comparing the trends of Petroleum Products usage in our country from 2012-13 to 2018-19.

alt text

Observations:

  1. The average usage of petroleum products in our country has increased by a very thin margin from 2012 to 2018.
  2. Observe the two farthest outliers for each Year, they are the union Territories, Dadar and Nagar Havelli(farthest) and Daman and Diu (closer).For these two union territories, the per capita in Kg usage of petroleum products has almost close to doubled from 706(2012) to 1160(2018) for Dadar and Nagar Havelli and 532(2012) to 910(2018) for Daman and Diu.
  3. Apart from 'Dadar and Nagar Havelli' and 'Daman and Diu' , there are two more union territories which also are outliers but not to much extent.Those are Goa and Puducherry.

Scatter plot

Data description:

Bio-chemical (CAB) 2014 Survey data of the district Jajapur (Odisha)

This data contains medical data of the people living in the Jajapur district of Odisha.I have used this data to see the correlation between age and BMI for the people in that district.

I have removed all the entries which had Nan in their Height or Weight catergory. I have removed all the entried which had Weights and lenght calculated for unborn babies. I have removed some of the columns which I didn'nt need for calculating the BMI. I converted the Height in centimeters to meters and calculated the BMI by the formula BMI = Weight in Kg / (ht in mtrs)^2

Below is the screen shot of the columns in the data.The columns BMI and Height_m are calculated and added by me.

Dataset 3

alt text

Plot:

Below is the scatter plot of Age vs BMI .Aim is to find any correlation or any age group prone to obese or malnutrition.

Note: The ranges for BMI classification in the graph legends, is taken from [4].

alt text

Observations:

  1. Upon careful observation, its alarming to see that almost all the people aged below 18 to 20 i.e. children are underweight(light blue region).This suggests that malnourishment in children is a major issue in that district.
  2. People from age 30 to 40 are more likely to be over weight or have Obesity.It is the age range(30-50) where the average BMI increases like a camel hump and then again decreases.

References

[1] [State/UT-wise Accidents Victims Killed according to classification of Age and Sex during 2017] (https://data.gov.in/resources/stateut-wise-accidents-victims-killed-according-classification-age-and-sex-during-2017)

[2] [State/UT-wise Per Capita Consumption of Petroleum Products from 2012-13 to 2018-19] (https://data.gov.in/resources/state/ut-wise-per-capita-consumption-of-petroleum-products-from-2012-1)

[3] [Annual Health Survey : Clinical, Anthropometric & Bio-chemical (CAB) Survey] (https://odisha.data.gov.in/resources/clinical-anthropometric-bio-chemical-cab-2014-survey-data-district-jajapur-odisha#web_catalog_tabs_block_10)

[4] https://www.nhlbi.nih.gov/health/educational/lose_wt/BMI/bmicalc.htm

About

This repository is for the DS 200 plotting assignment

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published