Code and Description for Visualizations for the Master in Computational Biology Statistic Data Analysis and Visualization course in 2020 Spain Mortality INE datasets(2022)
Authors : Lucía Martín Fernandez, Ana Solbas Casajús, Ángela Gómez Sacristán and Natalia García Sánchez
Data corresponds to the following datasets and come from the following corresponding sources:
Filename | Description | Used in code | Source |
---|---|---|---|
Provincias_datos.csv |
Mortality rate in Spanish provinces (2020) - manually preprocessed | BarplotsINEDeaths.Rmd , mortalityRatePerYear.Rmd , Spain_maps.Rmd links |
datos.gob.es |
crudeMortality.csv |
Crude mortality rate (per year) | BarplotsINEDeaths.Rmd , mortalityRatePerYear.Rmd , Spain_maps.Rmd links |
INE |
mortalityRate.csv |
Mortality rate per year, age and sex | mortalityRatePerYear.Rmd links |
|
causas_muerte.csv |
Death causes from 2010 to 2020 | mortalityRatePerYear.Rmd links |
INE |
ccaa_provincia.csv |
File mapping Autonomous Community to Province - manually preprocessed | BarplotsINEDeaths.Rmd links |
INE |
CCAA_DR_Mortality_Population.csv |
Crude and per 100,000 Mortality rates per Autonomous Community in 2020 | Spain_maps.Rmd link |
crafted in BarplotsINEDeaths.Rmd |
In addition, there are three R markdown files that have been used to generate the visualizations for the assignment.
BarplotsINEDeaths.Rmd
: script used to generate the barplot and boxplots charts.Spain_maps.Rmd
: script used to generate the Spain map plots containing information about the 2020 death rates by province and Autonomous Community.mortalityRatePerYear.Rmd
: script used to generate the time series plots for death rate and life expectancy in Spain as well as the death rate per age and sex in 2020 barplot.
Dependencies : mapSpain, reshape2, ggplot2, sf, dplyr, tidyr, tidyverse should be preinstalled in an R environment