Skip to content

nathanaelhub/DMA-Final

Repository files navigation

DMA-Final

Data Mining and Analysis Final Project: Nathan Matheis, Nathanael Johnson & Jeffrey Kendig

Dataset Link: https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data/vbim-akqf

Files Include:

covid_data_cleaner.ipynb Removes all rows with missing data and removes unwanted columns for the given initial dataset

covid_cleaned.csv Cleaned version of the original dataset created by covid_data_cleaner.ipynb

covid_kmeans.ipynb Performs a k-means analysis on the covid_cleaned.csv file, producing a scatterplot with clusters

covid_knn.ipynb Performs a k nearest neighbors analysis on the covid_cleaned.csv file, producing a confusion matrix and a classification report

Covid_Naive_Baise.ipynb Performs a Naive Bayes analysis on the covid_cleaned.csv file, producing a prediction on if a given individual will survive contgracting covid

Covid_visualize.ipynb Produces varius charts and figures based on the data provided by the covid_cleaned.csv file

About

Data Mining & Analysis Final

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published